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TECHNICAL FIELD 

The present invention relates generally to plant molecular biology. More 
20 specifically, it relates to nucleic acids and methods for modulating their expression 
in plants. 



BACKGROUND OF THE INVENTION 

Major advances in plant transformation have occurred over the last few 
25 years. However, in major crop plants, such as maize and soybeans, serious 

genotype limitations still exist. Transformation of agronomically important maize 
inbred lines continues to be both difficult and time consuming. Traditionally, the 
only way to elicit a culture response has been by optimizing medium components 
and/or explant material and source. This has led to success in some genotypes, 
30 but most elite hybrids fail to produce a favorable culture response. While, 
transformation of model genotypes is efficient, the process of introgressing 
transgenes into production inbreds is laborious, expensive and time consuming. It 



would save considerable time and money if genes could be introduced into and 
evaluated directly in commercial hybrids. 

Current methods for genetic engineering in maize require a specific cell 
type as the recipient of new DNA. These cells are found in relatively 
undifferentiated, rapidly growing callus cells or on the scutellar surface of the 
immature embryo (which gives rise to callus). Irrespective of the delivery method 
currently used, DNA is introduced into literally thousands of cells, yet 
transformants are recovered at frequencies of 10~ 5 relative to transiently- 
expressing cells. Exacerbating this problem, the trauma that accompanies DNA 
introduction directs recipient cells into cell cycle arrest and accumulating evidence 
suggests that many of these cells are directed into apoptosis or programmed cell 
death. (Reference Bowen et al, Third International Congress of the International 
Society for Plant Molecular Biology, 1991, Abstract 1093). Therefore it would be 
desirable to provide improved methods capable of increasing transformation 
efficiency in a number of cell types. 

Typically a selectable marker is used to recover transformed cells. 
Traditional selection schemes expose all cells to a phytotoxic agent and rely on 
the introduction of a resistance gene to recover transformants. Unfortunately, the 
presence of dying cells may reduce the efficiency of stable transformation. It 
would therefore be useful to provide a positive selection system for recovering 
transformants. 

In spite of increases in yield and harvested area worldwide, it is predicted 
that over the next ten years, meeting the demand for corn will require an 
additional 20% increase over current production (Dowswell, C.R., Paliwal, R.L., 
Cantrell, R.P., 1996, Maize in the Third World, Westview Press, Boulder, CO). 

In hybrid crops, including grains, oil seeds, forages, fruits and vegetables, 
there are problems associated with the development and production of hybrid 
seeds. The process of cross-pollination of plants is laborious and expensive. In 
the cross-pollination process, the female plant must be prevented from being 
fertilized by its own pollen. Many methods have been developed over the years, 
such as detasseling in the case of corn, developing and maintaining male sterile 
lines, and developing plants that are incompatible with their own pollen, to name a 



few. Since hybrids do not breed true, the process must be repeated for the 
production of every hybrid seed lot. 

To further complicate the process, inbred lines are crossed. For example 
in the case of corn, the inbreds can be low yielding. This provides a major 
challenge in the production of hybrid seed corn. In fact, certain hybrids cannot be 
commercialized at all due to the performance of the inbred lines. The production 
of hybrid seeds is consequently expensive, time consuming and provides known 
and unknown risks. It would therefore be valuable to develop new methods which 
contribute to the increase of production efficiency of hybrid seed. 

As new traits are added to commercial crops by means of genetic 
engineering, problems arise in "stacking" traits. In order to develop heritable 
stacked traits, the traits must be linked because of segregating populations. 
Improved methods for developing hybrid seed which would not require linking of 
the traits would significantly shorten the time for developing commercial hybrid 
seeds. 

Gene silencing is another problem in developing heritable traits with 
genetic engineering. Frequently gene silencing is seen following meiotic divisions. 
Elimination or reduction of this problem would advance the state of science and 
industry in this area. 

SUMMARY OF THE INVENTION 

It is the object of the present invention to provide nucleic acids and 
polypeptides relating to embryogenesis. 

It is another object of the present invention to provide nucleic acids and 
polypeptides that can be used to identify interacting proteins involved in 
transcription regulation in embryogenesis. 

It is another object of the present invention to provide antigenic fragments 
of the polypeptides of the present invention. 

It is another object of the present invention to provide transgenic plants and 
plant parts containing the nucleic acids of the present invention. 

It is another object of the present invention to provide methods for 
modulating, in a transgenic plant, the expression of the nucleic acids of the 
present invention. 
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It is another object of the present invention to provide a method for 
improving transformation frequencies. 

It is another object of the present invention to provide a method for 
improving transformation efficiency in cells from various sources. 

It is another object of the present invention to provide a method for a 
positive selection system. 

It is another object of the present invention to provide a method for 
efficiently producing hybrid seed via apomixis. 

It is another object of the present invention to provide a method for stacking 
traits which does not require linking of traits. 

It is another object of the present invention to provide a method for 
reducing the problem of gene silencing. 

The present invention relates to a HAP3-type CCAAT-box binding 
transcriptional activator polynucleotides and polypeptides, and in particular, the 
leafy cotyledon 1 transcriptional activator (LEC1) polynucleotides and 
polypeptides. In other aspects the present invention relates to expression 
cassettes optionally linked in antisense orientation, host cells transfected with at 
least one expression cassette, and transgenic plants and seeds comprising the 
expression cassettes. Further aspects of the invention include methods of using 
the polynucleotides and polypeptides. In a further aspect, the present invention 
relates to a method of modulating expression of the polynucleotides encoding the 
polypeptides of the present invention in a plant. Expression of the polynucleotides 
encoding the proteins of the present invention can be increased or decreased 
relative to a non-transformed control plant. 

BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 depicts the comparison of various sequences and the alignment of 
the conserved regions. 

DETAILED DESCRIPTION OF THE INVENTION 

DEFINITIONS 

The term "isolated" refers to material, such as a nucleic acid or a protein, 

which is: (1) substantially or essentially free from components which normally 
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accompany or interact with the material as found in its naturally occurring 
environment or (2) if the material is in its natural environment, the material has 
been altered by deliberate human intervention to a composition and/or placed at a 
locus in the cell other than the locus native to the material. 

As used herein, "nucleic acid" means a polynucleotide and includes single 
or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases. 
Nucleic acids may also include modified nucleotides that permit correct read 
through by a polymerase and do not alter the expression of a polypeptide 
encoded by the polynucleotide. 

As used herein, "LEC1 nucleic acid" means a nucleic acid or polynucleotide 
that codes for a LEC1 polypeptide. 

As used herein, "polypeptide" means proteins, protein fragments, modified 
proteins, amino acid sequences and synthetic amino acid sequences. The 
polypeptide can be glycosylated or not. 

As used herein, "LEC1 polypeptide" means a HAP3 family member, 
CCAAT-box binding transcriptional activator polypeptide that regulates gene 
expression during embryo development, and that contains the conserved 
sequence set out in SEQ ID NO: 23. 

As used herein, "plant" includes plants and plant parts including but not 
limited to plant cells, plant tissue such as leaves, stems, roots, flowers, and seeds. 

As used herein, "promoter" includes reference to a region of DNA upstream 
from the start of transcription and involved in recognition and binding of RNA 
polymerase and other proteins to initiate transcription. 

By "fragment" is intended a portion of the nucleotide sequence or a portion 
of the amino acid sequence and hence protein encoded thereby. Fragments of a 
nucleotide sequence may encode protein fragments that retain the biological 
activity of the native nucleic acid. Alternatively, fragments of a nucleotide 
sequence that are useful as hybridization probes generally do not encode 
fragment proteins retaining biological activity. Thus, fragments of a nucleotide 
sequence are generally greater than 20, 30, 50, 100, 150, 200 or 300 nucleotides 
and up to the entire nucleotide sequence encoding the proteins of the invention. 
Generally the probes are less than 1000 nucleotides and preferably less than 500 

nucleotides. Fragments of the invention include antisense sequences used to 
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decrease expression of the inventive polynucleotides. Such antisense fragments 
may vary in length ranging from at least about 20 nucleotides, about 50 
nucleotides, about 100 nucleotides, up to and including the entire coding 
sequence. 

By "functional equivalent" as applied to a polynucleotide or a protein is 
intended a polynucleotide or a protein of sufficient length to modulate the level of 
LEC1 protein activity in a plant cell. A polynucleotide functional equivalent can be 
in sense or antisense orientation. 

By "variants" is intended substantially similar sequences. Generally, 
nucleic acid sequence variants of the invention will have at least 60%, 65%, or 
70%, preferably 75%, 80% or 90%, more preferably at least 95% and most 
preferably at least 98% sequence identity to the native nucleotide sequence, 
wherein the % sequence identity is based on the entire sequence and is 
determined by GAP analysis using Gap Weight of 50 and Length Weight of 3. 
Generally, polypeptide sequence variants of the invention will have at least about 
50%, 55%, 60%, 65%, 70%, 75% or 80%, preferably at least about 85% or 90%, 
and more preferably at least about 95% sequence identity to the native protein, 
wherein the % sequence identity is based on the entire sequence and is 
determined by GAP analysis using Gap Weight of 12 and Length Weight of 4. 

A "responsive plant cell" or "responsive host cell" refers to a cell that 
exhibits a positive response to the introduction of LEC1 polypeptide or LEC1 
polynucleotide compared to a cell that has not been introduced with LEC1 
polypeptide or LEC1 polynucleotide. The response can be to enhance tissue 
culture response, induce somatic embryogenesis, induce apomixis, increase 
transformation efficiency or increase recovery of regenerated plants. 

A "recalcitrant plant cell" is a responsive plant cell that generally does not 
exhibit a positive response such as tissue culture response, transformation 
efficiency or recovery of regenerated plants. 

NUCLEIC ACIDS 

The present invention relates to a HAP3-type CCAAT-box binding 

transcriptional activators, and in particular, the leafy cotyledon 1 transcriptional 

activator (LEC1). Expression of the LEC1 polynucleotide initiates formation of 
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embryo-like structures and improves growth and recovery of transformants. The 
term apomixis is used to describe asexual reproduction that replaces or 
substitutes sexual methods of reproduction. When apomixis occurs, embryos are 
produced from maternal tissue and use only the maternal genome. 
5 In particular the present invention relates to an isolated nucleic acid 

comprising a member selected from the group consisting of: 

(a) a polynucleotide which encodes a polypeptide of SEQ ID NO: 2, 8, 
10, 12, 14, 16, 18,20, or 22; 

(b) a polynucleotide amplified from a plant nucleic acid library using the 
10 primers of SEQ ID NOS: 3 and 4, 5 and 6, 9 and 10, or 11 and 12 or 

primers determined by using Vector nti Suite, I nforMax Version 5. 

(c) a polynucleotide comprising at least 20 contiguous bases of SEQ ID 
NO: 1,7, 9, 11, 13,15, 17, 19 F or 21; 

(d) a polynucleotide encoding a plant HAP3-type ccaat-box transcriptional 
15 activator with the conserved motif of SEQ ID NO: 23, wherein the 

polynucleotide is from a plant other than Arabidopsis; 

(e) a polynucleotide having at least 60% sequence identity to SEQ ID 
NO: 1, 9, 11, 13, 17, or 21 or 65% sequence identity to SEQ ID NO: 
15 or 19 or 70% sequence identity to SEQ ID NO: 7, wherein the % 

20 sequence identity is based on the entire sequence and is determined 

by GAP analysis using Gap Weight of 50 and Length Weight of 3; 

(f) a polynucleotide comprising at least 25 nucleotides in length which 
hybridizes under high stringency conditions to a polynucleotide 
having the sequence set forth in SEQ ID NO: 1, 7, 9, 11, 13, 15, 17, 

25 19, or21; 

(g) a polynucleotide encoding the protein of SEQ ID NO: 2, 8, 10, 12, 
14, 16, 18, 20, or 22, wherein the polynucleotide is from a plant 
other than Arabidopsis] 

(h) a polynucleotide having the sequence set forth in SEQ ID NO: 1,7,9, 
30 11,13,15, 17,19, or21; and 

(i) a polynucleotide complementary to a polynucleotide of (a) through 
(h). 
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In many cases of apomixis maternal tissues such as the nucellus or inner 
integument "bud off' producing somatic embryos. These embryos then develop 
normally into seed. Since meiosis and fertilization are circumvented, the plants 
developing from such seed are genetically identical to the maternal plant. 
5 Expression of the leafy cotyledon 1 gene in the nucellus integument, or cell 

specific expression in the megaspore mother cell would trigger embryo formation 
from maternal tissues. 

Producing a seed identical to the parent has many advantages. For 
example high yielding hybrids could be used in seed production to multiply 
10 identical copies of high yielding hybrid seed. This would greatly reduce seed cost 
as well as increase the number of genotypes which are commercially available. 
Genes can be evaluated directly in commercial hybrids since the progeny would 
not segregate. This would save years of back crossing. 

Apomixis would also provide a method of containment of transgenes when 
15 coupled with male sterility. The construction of male sterile autonomous 

agamospermy would prevent genetically engineered traits from hybridizing with 
weedy relatives. 

Gene stacking would be relatively easy with apomixis. Hybrids could be 
successively re-transformed with various new traits and propagated via apomixis. 

20 The traits would not need to be linked since apomixis avoids the problems 
associated with segregation. 

Apomixis can provide a reduction in gene silencing. Gene silencing is 
frequently seen following meiotic divisions. Since meiotic divisions never occur, it 
may be possible to eliminate or reduce the frequency of gene silencing. Apomixis 

25 can also be used stabilize desirable phenotypes with complex traits such as 

hybrid vigor. Such traits could easily be maintained and multiplied indefinitely via 
apomixis. 

The Cauliflower Mosaic Virus 35S promoter has been used to overexpress 
LEC1 during Agrobacterium-mediated in planta transformation of Arabidopsis 
30 (Harada et ai, WO 98/37184). As pointed out by Harada et al, 35S is a strong 
promoter, and in their experiments it was found that 35S:LEC1 did not improve 
transformation and actually appeared to hinder it (transformation efficiency with 
35S:LEC1 was estimated to be 0.6% of that obtained normally). Thus, 



overexpression in a cell type such as those in the gametophytic stage of 
development may be inappropriate and detrimental to the transformation process 
and successful recovery of transformed progeny. In contrast, we have shown that 
ectopic expression of the LEC1 gene under the appropriate control elements 
(including tissue specific and/or inducible promoters) and in the appropriate plant 
cells can be used to stimulate embryo formation in tissues/genotypes normally not 
amenable to culture. Likewise ectopic expression in genotypes amenable to 
culture can increase the number of embryo precursor cells (or increase the 
number that develop into embryos) leading to an increase in transformation 
frequency. Transient expression using RNA or protein may be sufficient to initiate 
the cascade of events leading to embryo formation. This would be valuable in 
such target tissues as maize scutella, immature leaf bases, immature tassels, etc. 
The LEC1 gene could be used as a positive selectable marker, i.e. triggering 
embryogenesis in transgenic cells without killing the surrounding wild-type cells. 
This would happen since the cells receiving the introduced gene would undergo 
embryogenesis or in tissues already undergoing embryogenesis LEC1 expression 
would stimulate more rapid reiteration of somatic embryos. 

It has been shown through sequence similarity that the Arabidopsis LEC1 
polypeptide is homologous to the HAP3 subunit of the "CCAAT-box binding factor" 
class of eukaryotic transcriptional activators (Lotan era/., 1998, Cell 93:1195- 
1205). This class of proteins, which consist of Hap2/3/4 and 5, form a 
heteroligomeric transcriptional complex, that appears to activate specific gene 
sets in eukaryotes. Certain members of this family such as Hap2 and Hap5 
appear to be ubiquitously expressed, while different Hap3 members are under 
developmental or environmental regulation. Plant HAP3 polypeptides can be 
recognized by a high degree of sequence identity to other HAP3 homologs in the 
"B domain" of the protein. For example, the B domain for the Arabidopsis LEC1 , 
from amino acid residue 28 to residue 117, shares between 55% and 63% identity 
(75-85% similarity) to other members of the HAP3 family, including maize (HAP3), 
chicken, lamprey, Xenopus, human, mouse, Emericella nidulens, 
Schizosaccharomyces pombe, Saccharomyces cerevisiae and Kluuyveromyces 
lactis (Lotan et ai, 1 998). 



Expression of the LEC1 gene in transformed cells initiates embryo 
development and stimulates development of pre-existing embryos. Normally, 
LEC1 expression is necessary for proper embryo maturation in the latter stages of 
embryo development, and LEC1 transgene expression thus may also promote 
these processes. The combined effect of these impacts on somatic 
embryogenesis is not only to stimulate growth of transformed cells, but also to 
insure that transformed somatic embryos develop in a normal, viable fashion 
(increasing the capacity of transformed somatic embryos to germinate vigorously). 
Continued ectopic overexpression beyond embryo maturation may negatively 
impact germination and vegetative plant growth (which may necessitate down- 
regulation of the LEC1 transgene during these stages of development. 

Expression of the LEC1 gene will stimulate growth in cells with the potential 
to initiate or maintain embryogenic growth. Cells in established meristems or 
meristem-derive cell lineages may be less prone to undergo the transition to 
embryos. In addition, transformation methods that target certain reproductive 
tissues (or cells) such as vacuum-infiltration of Agrobacterium into Arabidopsis 
may have detrimental effects on recovery of transformants (triggering genes 
associated with embryogenesis may disrupt the proper functioning of these cells). 

The polypeptides encoded by the present plant LEC1 genes can be 
distinguished from non-LEC HAP3 proteins by using the diagnostic motif shown in 
SEQ ID NO: 23. 

The isolated nucleic acids of the present invention can be made using (a) 
standard recombinant methods, (b) synthetic techniques, or combinations thereof. 
In some embodiments, the polynucleotides of the present invention will be cloned, 
amplified, or otherwise constructed from a monocot or dicot. In preferred 
embodiments the monocot is corn, sorghum, barley, wheat, millet, or rice. 
Preferred dicots include soybeans, sunflower, canola, alfalfa, potato, or cassava. 

Functional fragments included in the invention can be obtained using 

primers which selectively hybridize under stringent conditions. Primers are 

generally at least 12 bases in length and can be as high as 200 bases, but will 

generally be from 15 to 75, preferably from 15 to 50. Functional fragments can be 

identified using a variety of techniques such as restriction analysis, Southern 

analysis, primer extension analysis, and DNA sequence analysis. 
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The present invention includes a plurality of polynucleotides that encode for 
the identical amino acid sequence. The degeneracy of the genetic code allows for 
such "silent variations" which can be used, for example, to selectively hybridize 
and detect allelic variants of polynucleotides of the present invention. 
Additionally, the present invention includes isolated nucleic acids comprising 
allelic variants. The term "allele" as used herein refers to a related nucleic acid of 
the same gene. 

Variants of nucleic acids included in the invention can be obtained, for 

example, by oligonucleotide-directed mutagenesis, linker-scanning mutagenesis, 

mutagenesis using the polymerase chain reaction, and the like. See, for example, 

Ausubel, pages 8.0.3 - 8.5.9. Also, see generally, McPherson (ed.). DIRECTED 

MUTAGENESIS: A Practical Approach, (IRL Press, 1991). Thus, the present 

invention also encompasses DNA molecules comprising nucleotide sequences 

that have substantial sequence similarity with the inventive sequences. 

Variants included in the invention may contain individual substitutions, 

deletions or additions to the nucleic acid or polypeptide sequences which alters, 

adds or deletes a single amino acid or a small percentage of amino acids in the 

encoded sequence is a "conservatively modified variant" where the alteration 

results in the substitution of an amino acid with a chemically similar amino acid. 

When the nucleic acid is prepared or altered synthetically, advantage can be 

taken of known codon preferences of the intended host. 

The present invention also includes "shufflents" produced by sequence 

shuffling of the inventive polynucleotides to obtain a desired characteristic. 

Sequence shuffling is described in PCT publication No. 96/19256. See also, 

Zhang, J. H., etal., Proc. Natl. Acad. Sci. USA 94:4504-4509 (1997). 

The present invention also includes the use of 5' and/or 3' UTR regions for 

modulation of translation of heterologous coding sequences. Positive sequence 

motifs include translational initiation consensus sequences (Kozak, Nucleic Acids 

Res. 15:8125 (1987)) and the 7-methylguanosine cap structure (Drummond et al., 

Nucleic Acids Res. 13:7375 (1985)). Negative elements include stable 

intramolecular 5' UTR stem-loop structures (Muesing etal., Cell 48:691 (1987)) 

and AUG sequences or short open reading frames preceded by an appropriate 

AUG in the 5' UTR (Kozak, supra, Rao etal., Mol. and Cell. Biol. 8:284 (1988)). 
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Further, the poiypeptide-encoding segments of the polynucleotides of the 
present invention can be modified to alter codon usage. Altered codon usage can 
be employed to alter translational efficiency. Codon usage in the coding regions 
of the polynucleotides of the present invention can be analyzed statistically using 
5 commercially available software packages such as "Codon Preference" available 
from the University of Wisconsin Genetics Computer Group (see Devereaux et al., 
Nucleic Acids Res. 12:387-395 (1984)) or MacVector 4.1 (Eastman Kodak Co., 
New Haven, Conn.). 

For example, the inventive nucleic acids can be optimized for enhanced 

10 expression in plants of interest. See, for example, EPA0359472; W091/16432; 
Perlak etal. (1991) Proc. Natl. Acad Sci. USA 88:3324-3328; and Murray etai 
(1989) Nucleic Acids Res. 77:477-498. In this manner, the polynucleotides can 
be synthesized utilizing plant-preferred codons. See, for example, Murray et al. 
(1989) Nucleic Acids Res. 77:477-498, the disclosure of which is incorporated 

15 herein by reference. 

The present invention provides subsequences comprising isolated nucleic 
acids containing at least 16 contiguous bases of the inventive sequences. For 
example the isolated nucleic acid includes those comprising at least 16, 20, 25, 
30, 40, 50, 60, 75 or 100 contiguous nucleotides of the inventive sequences. 

20 Subsequences of the isolated nucleic acid can be used to modulate or detect 
gene expression by introducing into the subsequences compounds which bind, 
intercalate, cleave and/or crosslink to nucleic acids. 

The nucleic acids of the invention may conveniently comprise a multi- 
cloning site comprising one or more endonuclease restriction sites inserted into 

25 the nucleic acid to aid in isolation of the polynucleotide. Also, translatable 

sequences may be inserted to aid in the isolation of the translated polynucleotide 
of the present invention. For example, a hexa-histidine marker sequence 
provides a convenient means to purify the proteins of the present invention. 
A polynucleotide of the present invention can be attached to a vector, 

30 adapter, promoter, transit peptide or linker for cloning and/or expression of a 

polynucleotide of the present invention. Additional sequences may be added to 

such cloning and/or expression sequences to optimize their function in cloning 

and/or expression, to aid in isolation of the polynucleotide, or to improve the 
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introduction of the polynucleotide into a cell. Use of cloning vectors, expression 
vectors, adapters, and linkers is well known and extensively described in the art. 
For a description of such nucleic acids see, for example, Stratagene Cloning 
Systems, Catalogs 1995, 1996, 1997 (La Jolla, CA); and, Amersham Life 
5 Sciences, Inc, Catalog '97 (Arlington Heights, IL). 

The isolated nucleic acid compositions of this invention, such as RNA, 
cDNA, genomic DNA, or a hybrid thereof, can be obtained from plant biological 
sources using any number of cloning methodologies known to those of skill in the 
art. In some embodiments, oligonucleotide probes which selectively hybridize, 

10 under stringent conditions, to the polynucleotides of the present invention are 
used to identify the desired sequence in a cDNA or genomic DNA library. 

Exemplary total RNA and mRNA isolation protocols are described in Plant 
Molecular Biology: A Laboratory Manual, Clark, Ed., Springer-Verlag, Berlin 
(1997); and, Current Protocols in Molecular Biology, Ausubel, et a/., Eds., Greene 

15 Publishing and Wiley-lnterscience, New York (1995). Total RNA and mRNA 
isolation kits are commercially available from vendors such as Stratagene (La 
Jolla, CA), Clonetech (Palo Alto, CA), Pharmacia (Piscataway, NJ), and 5'-3' 
(Paoli, PA). See also, U.S. Patent Nos. 5,614,391; and, 5,459,253. 

Typical cDNA synthesis protocols are well known to the skilled artisan and 

20 are described in such standard references as: Plant Molecular Biology: A 
Laboratory Manual, Clark, Ed., Springer-Verlag, Berlin (1997); and, Current 
Protocols in Molecular Biology, Ausubel, et a/., Eds., Greene Publishing and 
Wiley-lnterscience, New York (1995). cDNA synthesis kits are available from a 
variety of commercial vendors such as Stratagene or Pharmacia. 

25 An exemplary method of constructing a greater than 95% pure full-length 

cDNA library is described by Carninci et ai, Genomics, 37:327-336 (1996). Other 
methods for producing full-length libraries are known in the art. See, e.g., Edery 
etal., Mol. Cell Biol. 15(6):3363-3371 (1995); and PCT Application WO 96/34981. 
It is often convenient to normalize a cDNA library to create a library in 

30 which each clone is more equally represented. A number of approaches to 
normalize cDNA libraries are known in the art. Construction of normalized 
libraries is described in Ko, NucL Acids. Res. 18(19):5705-5711 (1990); Patanjali 
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etal., Proc. Natl. Acad. U.S.A. 88:1943-1947 (1991); U.S. Patents 5,482,685 and 
5,637,685; and Soares etai, Proc. Natl. Acad. Sci. USA 91:9228-9232 (1994). 

Subtracted cDNA libraries are another means to increase the proportion of 
less abundant cDNA species. See, Foote et al. in, Plant Molecular Biology: A 
5 Laboratory Manual, Clark, Ed., Springer- Verlag, Berlin (1997); Kho and Zarbl, 
Technique 3(2):58-63 (1991); Sive and St. John, Nucl. Acids Res. 16(22):10937 
(1988); Current Protocols in Molecular Biology, Ausubel, et al., Eds., Greene 
Publishing and Wiley-lnterscience, New York (1995); and, Swaroop etai, Nucl. 
Acids Res. 19(8): 1954 (1991). cDNA subtraction kits are commercially available. 

10 See, e.g., PCR-Select (Clontech). 

To construct genomic libraries, large segments of genomic DNA are 
generated by random fragmentation. Examples of appropriate molecular 
biological techniques and instructions are found in Sambrook, et al., Molecular 
Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Vols. 1-3 

15 (1989), Methods in Enzymology, Vol. 152: Guide to Molecular Cloning 

Techniques, Berger and Kimmel, Eds., San Diego: Academic Press, Inc. (1987), 
Current Protocols in Molecular Biology, Ausubel, et a/., Eds., Greene Publishing 
and Wiley-lnterscience, New York (1995); Plant Molecular Biology: A Laboratory 
Manual, Clark, Ed., Springer-Verlag, Berlin (1997). Kits for construction of 

20 genomic libraries are also commercially available. 

The cDNA or genomic library can be screened using a probe based upon 
the sequence of a nucleic acid of the present invention such as those disclosed 
herein. Probes may be used to hybridize with genomic DNA or cDNA sequences 
to isolate homologous polynucleotides in the same or different plant species. 

25 Those of skill in the art will appreciate that various degrees of stringency of 
hybridization can be employed in the assay; and either the hybridization or the 
wash medium can be stringent. The degree of stringency can be controlled by 
temperature, ionic strength, pH and the presence of a partially denaturing solvent 
such as formamide. 

30 Typically, stringent hybridization conditions will be those in which the salt 

concentration is less than about 1.5 M Na ion, typically about 0.01 to 1 .0 M Na ion 

concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least 

about 30°C for short probes (e.g., 10 to 50 nucleotides) and at least about 60°C 
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for long probes (e.g., greater than 50 nucleotides). Stringent conditions may also 
be achieved with the addition of destabilizing agents such as formamide. 

For purposes of defining the invention the following conditions are 
provided. Exemplary low stringency conditions include hybridization with a buffer 
5 solution of 30 to 35% formamide, 1 M NaCI, 1% SDS (sodium dodecyl sulfate) at 
37°C, and a wash in 1X to 2X SSC (20X SSC = 3.0 M NaCI/0.3 M trisodium 
citrate) at 50°C. Exemplary moderate stringency conditions include hybridization 
in 40 to 45% formamide, 1 M NaCI, 1% SDS at 37°C, and a wash in 0.5X to 1X 
SSC at 55°C. Exemplary high stringency conditions include hybridization in 50% 

10 formamide, 1 M NaCI, 1% SDS at 37°C, and a wash in 0.1 X SSC at 60°C. 
Typically the time of hybridization is from 4 to 16 hours. 

An extensive guide to the hybridization of nucleic acids is found in Tijssen, 
Laboratory Techniques in Biochemistry and Molecular Biology—Hybridization with 
Nucleic Acid Probes, Part I, Chapter 2 "Overview of principles of hybridization and 

1 5 the strategy of nucleic acid probe assays", Elsevier, New York (1 993); and 

Current Protocols in Molecular Biology, Chapter 2, Ausubel, etai, Eds., Greene 
Publishing and Wiley-lnterscience, New York (1995). Often, cDNA libraries will be 
normalized to increase the representation of relatively rare cDNAs. 

The nucleic acids of the invention can be amplified from nucleic acid 

20 samples using amplification techniques. For instance, polymerase chain reaction 
(PCR) technology can be used to amplify the sequences of polynucleotides of the 
present invention and related polynucleotides directly from genomic DNA or cDNA 
libraries. PCR and other in vitro amplification methods may also be useful, for 
example, to clone nucleic acid sequences that code for proteins to be expressed, 

25 to make nucleic acids to use as probes for detecting the presence of the desired 
mRNA in samples, for nucleic acid sequencing, or for other purposes. 

Examples of techniques useful for//? vitro amplification methods are found 
in Berger, Sambrook, and Ausubel, as well as Mullis etai, U.S. Patent No. 
4,683,202 (1987); and, PCR Protocols A Guide to Methods and Applications, Innis 

30 etai, Eds., Academic Press Inc., San Diego, CA (1990). Commercially available 

kits for genomic PCR amplification are known in the art. See, e.g., Advantage-GC 

Genomic PCR Kit (Clontech). The T4 gene 32 protein (Boehringer Mannheim) 

can be used to improve yield of long PCR products. PCR-based screening 
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methods have also been described. Wilfinger etai describe a PCR-based 
method in which the longest cDNA is identified in the first step so that incomplete 
clones can be eliminated from study. BioTechniques, 22(3):481-486 (1997). 

In one aspect of the invention, nucleic acids can be amplified from a plant 
5 nucleic acid library. The nucleic acid library may be a cDNA library, a genomic 
library, or a library generally constructed from nuclear transcripts at any stage of 
intron processing. Libraries can be made from a variety of plant tissues. Good 
results have been obtained using mitotically active tissues such as shoot 
meristems, shoot meristem cultures, embryos, callus and suspension cultures, 
10 immature ears and tassels, and young seedlings. The cDNAs of the present 
invention were obtained from immature zygotic embryo and regenerating callus 
libraries. 

Alternatively, the sequences of the invention can be used to isolate 
corresponding sequences in other organisms, particularly other plants, more 

15 particularly, other monocots. In this manner, methods such as PCR, hybridization, 
and the like can be used to identify such sequences having substantial sequence 
similarity to the sequences of the invention. See, for example, Sambrook et al 
(1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor 
Laboratory Press, Plainview, New York), and Innis et al. (1990), PCR Protocols: A 

20 Guide to Methods and Applications (Academic Press, New York). Coding 

sequences isolated based on their sequence identity to the entire inventive coding 
sequences set forth herein or to fragments thereof are encompassed by the 
present invention. 

The isolated nucleic acids of the present invention can also be prepared by 

25 direct chemical synthesis by methods such as the phosphotriester method of 

Narang et al, Meth. Enzymol. 68:90-99 (1979); the phosphodiester method of 

Brown et al., Meth. Enzymol 68:109-151 (1979); the diethylphosphoramidite 

method of Beaucage et al, Tetra. Lett 22:1859-1862 (1981); the solid phase 

phosphoramidite triester method described by Beaucage and Caruthers, Tetra. 

30 Letts. 22(20): 1859-1 862 (1981), e.g., using an automated synthesizer, e.g., as 

described in Needham-VanDevanter et al, Nucleic Acids Res. 12:6159-6168 

(1984); and, the solid support method of U.S. Patent No. 4,458,066. Chemical 

synthesis generally produces a single stranded oligonucleotide. This may be 
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converted into double stranded DNA by hybridization with a complementary 
sequence, or by polymerization with a DNA polymerase using the single strand as 
a template. One of skill will recognize that while chemical synthesis of DNA is 
limited to sequences of about 100 bases, longer sequences may be obtained by 
5 the ligation of shorter sequences. 

The nucleic acids of the present invention include those amplified using the 
following primer pairs: SEQ ID NOS: 3-6 and 9-12. 

EXPRESSION CASSETTES 

10 In another embodiment expression cassettes comprising isolated nucleic 

acids of the present invention are provided. An expression cassette will typically 
comprise a polynucleotide of the present invention operably linked to 
transcriptional initiation regulatory sequences which will direct the transcription of 
the polynucleotide in the intended host cell, such as tissues of a transformed 

15 plant 

The construction of such expression cassettes which can be employed in 
conjunction with the present invention is well known to those of skill in the art in 
light of the present disclosure. See, e.g., Sambrook, et a/.; Molecular Cloning: A 
Laboratory Manual] Cold Spring Harbor, New York; (1989); Gelvin, etal.\ Plant 
20 Molecular Biology Manual (1 990); Plant Biotechnology: Commercial Prospects 
and Problems, eds. Prakash, etai\ Oxford & IBH Publishing Co.; New Delhi, 
India; (1993); and Heslot, etai\ Molecular Biology and Genetic Engineering of 
Yeasts; CRC Press, Inc., USA; (1992); each incorporated herein in its entirety by 
reference. 

25 For example, plant expression vectors may include (1) a cloned plant gene 

under the transcriptional control of 5* and 3' regulatory sequences and (2) a 
dominant selectable marker. Such plant expression vectors may also contain, if 
desired, a promoter regulatory region (e.g., one conferring inducible, constitutive, 
environmentally- or developmentally-regulated, or cell- or tissue-specific/selective 

30 expression), a transcription initiation start site, a ribosome binding site, an RNA 

processing signal, a transcription termination site, and/or a polyadenylation signal. 

Constitutive, tissue-preferred or inducible promoters can be employed. 

Examples of constitutive promoters include the cauliflower mosaic virus (CaMV) 
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35S transcription initiation region, the 1- or 2'- promoter derived from T-DNA of 
Agrobacterium tumefaciens, the actin promoter, the ubiquitin promoter, the 
histone H2B promoter (Nakayama et a/., 1992, FEBS Lett 30:167-170), the Smas 
promoter, the cinnamyl alcohol dehydrogenase promoter (U.S. Patent No. 
5 5,683,439), the Nos promoter, the pEmu promoter, the rubisco promoter, the 
GRP1-8 promoter, and other transcription initiation regions from various plant 
genes known in the art. 

Examples of inducible promoters are the Adh1 promoter which is inducible 
by hypoxia or cold stress, the Hsp70 promoter which is inducible by heat stress, 

10 the PPDK promoter which is inducible by light, the In2 promoter which is safener 
induced, the ERE promoter which is estrogen induced and the Pepcarboxylase 
promoter which is light induced. 

Examples of promoters under developmental control include promoters that 
initiate transcription preferentially in certain tissues, such as leaves, roots, fruit, 

15 seeds, or flowers. An exemplary promoter is the anther specific promoter 5126 
(U.S. Patent Nos. 5,689,049 and 5,689,051). Examples of seed-preferred 
promoters include, but are not limited to, 27 kD gamma zein promoter and waxy 
promoter, Boronat, A., Martinez, M.C., Reina, M., Puigdomenech, P. and Palau, 
J.; Isolation and sequencing of a 28 kD glutelin-2 gene from maize: Common 

20 elements in the 5' flanking regions among zein and glutelin genes; Plant Sc/. 

47:95-102 (1986) and Reina, M., Ponte, I., Guillen, P., Boronat, A. and Palau, J., 
Sequence analysis of a genomic clone encoding a Zc2 protein from Zea mays 
W64 A, Nucleic Acids Res. 18(21):6426 (1990). See the following site relating to 
the waxy promoter: Kloesgen, R.B., Gierl,A., Schwarz-Sommer, Z.S. and Saedler, 

25 H., Molecular analysis of the waxy locus of Zea mays, MoL Gen. Genet 203:237- 
244 (1986). The disclosures each of these are incorporated herein by reference 
in their entirety. 

Preferably a weak constitutive promoter, such as the Nos promoter, an 

inducible promoter, such as In2, or a nucellus-preferred or integument-preferred 

30 promoter are used to induce apospory. For example the barley or maize Nuc1 

promoter, the maize Cim 1 promoter or the maize LTP2 promoter can be used to 

preferentially express in the nucellus. See for example US Serial No. 60/097,233 

filed August 20, 1998 the disclosure of which is incorporated herein by reference. 
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Either heterologous or non-heterologous (i.e., endogenous) promoters can 
be employed to direct expression of the nucleic acids of the present invention. 
These promoters can also be used, for example, in expression cassettes to drive 
expression of antisense nucleic acids to reduce, increase, or alter concentration 
5 and/or composition of the proteins of the present invention in a desired tissue. 

If polypeptide expression is desired, it is generally desirable to include a 
polyadenylation region at the 3'-end of a polynucleotide coding region. The 
polyadenylation region can be derived from the natural gene, from a variety of 
other plant genes, or from T-DNA. The 3' end sequence to be added can be 
10 derived from, for example, the nopaline synthase or octopine synthase genes, or 
alternatively from another plant gene, or less preferably from any other eukaryotic 
gene. 

An intron sequence can be added to the 5' untranslated region or the 
coding sequence of the partial coding sequence to increase the amount of the 

15 mature message that accumulates. See for example Buchman and Berg, Mol. 
Cell Biol. 8:4395-4405 (1988); Callis et a/ M Genes Dev. 1:1183-1200 (1987). Use 
of maize introns Adh1-S intron 1, 2, and 6, the Bronze-1 intron are known in the 
art. See generally, The Maize Handbook, Chapter 116, Freeling and Walbot, 
Eds., Springer, New York (1994). 

20 The vector comprising the sequences from a polynucleotide of the present 

invention will typically comprise a marker gene which confers a selectable 
phenotype on plant cells. Usually, the selectable marker gene will encode 
antibiotic or herbicide resistance. Suitable genes include those coding for 
resistance to the antibiotics spectinomycin and streptomycin (e.g., the aada 

25 gene), the streptomycin phosphotransferase (SPT) gene coding for streptomycin 
resistance, the neomycin phosphotransferase (NPTII) gene encoding kanamycin 
orgeneticin resistance, the hygromycin phosphotransferase (HPT) gene coding 
for hygromycin resistance. 

Suitable genes coding for resistance to herbicides include those which act 

30 to inhibit the action of acetolactate synthase (ALS), in particular the 

sulfonylurea-type herbicides (e.g., the acetolactate synthase (ALS) gene 
containing mutations leading to such resistance in particular the S4 and/or Hra 
mutations), those which act to inhibit action of glutamine synthase, such as 



phosphinothricin or basta (e.g., the bar gene), or other such genes known in the 
art. The bar gene encodes resistance to the herbicide basta and the ALS gene 
encodes resistance to the herbicide chlorsulfuron. While useful in conjunction 
with the above antibiotic and herbicide-resistance selective markers (i.e. use of 
5 the LEC1 gene can increase transformation frequencies when using chemical 
selection), a prefered use of LEC1 expression takes advantage of this gene 
conferring a growth advantage to transformed cells without the need for inhibitory 
compounds to retard non-transformed growth. Thus, LEC1 transformants are 
recovered based solely on their differential growth advantage. 

10 Typical vectors useful for expression of genes in higher plants are well 

known in the art and include vectors derived from the tumor-inducing (Ti) plasmid 
of Agrobacterium tumefaciens described by Rogers et a/., Meth. In Enzymol. 
153:253-277 (1987). Exemplary A. tumefaciens vectors useful herein are 
plasmids pKYLX6 and pKYLX7 of Schardl et a/ M Gene, 61:1-11 (1987) and Berger 

15 et a/., Proc. Natl. Acad Sci. USA 86:8402-8406 (1989). Another useful vector 

herein is plasmid pBI101.2 that is available from Clontech Laboratories, Inc. (Palo 
Alto, CA). 

A variety of plant viruses that can be employed as vectors are known in the 
art and include cauliflower mosaic virus (CaMV), geminivirus, brome mosaic virus, 
20 and tobacco mosaic virus. 

A polynucleotide of the present invention can be expressed in either sense 
or anti-sense orientation as desired. In plant cells, it has been shown that 
antisense RNA inhibits gene expression by preventing the accumulation of mRNA 
which encodes the enzyme of interest, see, e.g., Sheehy et a/., Proc. Natl. Acad. 
25 Sci. USA 85:8805-8809 (1988); and Hiatt et a/., U.S. Patent No. 4,801,340. 

Another method of suppression is sense suppression. Introduction of 
nucleic acid configured in the sense orientation has been shown to be an effective 
means by which to block the transcription of target genes. For an example of the 
use of this method to modulate expression of endogenous genes see, Napoli et 
30 a/., The Plant Cell 2:279-289 (1990) and U.S. Patent No. 5,034,323. 

Recent work has shown suppression with the use of double stranded RNA. 
Such work is described in Tabara et a/., Science 282:5388:430-431 (1998). 



-20- 



Catalytic RNA molecules or ribozymes can also be used to inhibit 
expression of plant genes. The inclusion of ribozyme sequences within antisense 
RNAs confers RNA-cleaving activity upon them, thereby increasing the activity of 
the constructs. The design and use of target RNA-specific ribozymes is described 
5 in Haseloff et a/., Nature 334:585-591 (1988). 

A variety of cross-linking agents, alkylating agents and radical generating 
species as pendant groups on polynucleotides of the present invention can be 
used to bind, label, detect, and/or cleave nucleic acids. For example, Vlassov, V. 
V., et al., Nucleic Acids Res (1986) 14:4065-4076, describe covalent bonding of a 

10 single-stranded DNA fragment with alkylating derivatives of nucleotides 

complementary to target sequences. A report of similar work by the same group 
is that by Knorre, D. G., et al., Biochimie (1985) 67:785-789. Iverson and Dervan 
also showed sequence-specific cleavage of single-stranded DNA mediated by 
incorporation of a modified nucleotide which was capable of activating cleavage 

15 (J. Am. Chem. Soc. (1987) 109:1241-1243). Meyer, R. B., etal., J. Am. Chem. 
Soc. (1989) 111:8517-8519, effect covalent crosslinking to a target nucleotide 
using an alkylating agent complementary to the single-stranded target nucleotide 
sequence. A photoactivated crosslinking to single-stranded oligonucleotides 
mediated by psoralen was disclosed by Lee, B. L, etal., Biochemistry (1988) 

20 27:3197-3203. Use of crosslinking in triple-helix forming probes was also 

disclosed by Home, et al., J. Am. Chem. Soc. (1990) 1 12:2435-2437. Use of N4, 
N4-ethanocytosine as an alkylating agent to crosslink to single-stranded 
oligonucleotides has also been described by Webb and Matteucci, J. Am. Chem. 
Soc. (1986) 108:2764-2765; Nucleic Acids Res (1986) 14:7661-7674; Feteritz et 

25 al., J. Am, Chem. Soc. 1 13:4000 (1991). Various compounds to bind, detect, 
label, and/or cleave nucleic acids are known in the art. See, for example, U.S. 
Patent Nos. 5,543,507; 5,672,593; 5,484,908; 5,256,648; and, 5,681941. 



Proteins 

30 In another aspect, the invention relates to an isolated protein comprising a 

member selected from the group consisting of: 

(a) a polypeptide comprising at least 25 contiguous amino acids of SEQ 

ID NO: 2, 8, 10, 12, 14, 16, 18, 20, or 22; 
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(b) a polypeptide which is a plant HAP3-type CCATT-box binding 
transcriptional activator that regulates gene expression during 
embryo development and maturation; 

(c) a polypeptide comprising at least 60% sequence identity to SEQ ID 

5 NO: 2, 12, 14, 16, 20, or 22, or 70 % sequence identity to SEQ ID NO: 

8, 1 0, or 18, wherein the % sequence identity is based on the entire 
sequence and is determined by GAP analysis using Gap Weight of 12 
and Length Weight of 4; 

(d) a polypeptide encoded by a nucleic acid of claim 1 ; 

10 (e) a polypeptide encoded by a nucleic acid of SEQ ID NO: 1 , 7, 9, 1 1 , 

13,15,17,19, or21;and 
(f) a polypeptide having the sequence set forth in SEQ ID NO: 2, 8, 10, 
12, 14, 16, 18, 20, or 22. 



15 Proteins of the present invention include proteins derived from the native 

protein by deletion (so-called truncation), addition or substitution of one or more 
amino acids at one or more sites in the native protein. Such variants may result 
from, for example, genetic polymorphism or from human manipulation. Methods 
for such manipulations are generally known in the art. 

20 For example, amino acid sequence variants of the polypeptide can be 

prepared by mutations in the cloned DNA sequence encoding the native protein of 
interest. Methods for mutagenesis and nucleotide sequence alterations are well 
known in the art. See, for example, Walker and Gaastra, eds. (1983) Techniques 
in Molecular Biology (MacMillan Publishing Company, New York); Kunkel (1985) 

25 Proc. Natl. Acad. Sci USA 82:488-492; Kunkel etai (1987) Methods Enzymol. 
154:367-382; Sambrook etai (1989) Molecular Cloning: A Laboratory Manual 
(Cold Spring Harbor, New York); U.S. Patent No. 4,873,192; and the references 
cited therein; herein incorporated by reference. Guidance as to appropriate amino 
acid substitutions that do not affect biological activity of the protein of interest may 

30 be found in the model of Dayhoff et ai (1 978) Atlas of Protein Sequence and 

Structure (Natl. Biomed. Res. Found., Washington, D.C.), herein incorporated by 

reference. Conservative substitutions, such as exchanging one amino acid with 

another having similar properties, may be preferred. 

-22- 



In constructing variants of the proteins of interest, modifications to the 
nucleotide sequences encoding the variants will be made such that variants 
continue to possess the desired activity. Obviously, any mutations made in the 
DNA encoding the variant protein must not place the sequence out of reading 
5 frame and preferably will not create complementary regions that could produce 
secondary mRNA structure. See EP Patent Application Publication No. 75,444. 

The isolated proteins of the present invention include a polypeptide 
comprising at least 23 contiguous amino acids encoded by any one of the nucleic 
acids of the present invention, or polypeptides which are conservatively modified 

1 0 variants thereof. The proteins of the present invention or variants thereof can 

comprise any number of contiguous amino acid residues from a polypeptide of the 
present invention, wherein that number is selected from the group of integers 
consisting of from 23 to the number of residues in a full-length polypeptide of the 
present invention. Optionally, this subsequence of contiguous amino acids is at 

15 least 25, 30, 35, or 40 amino acids in length, often at least 50, 60, 70, 80, or 90 
amino acids in length. 

The present invention includes catalytically active polypeptides (i.e., 
enzymes). Catalytically active polypeptides will generally have a specific activity 
of at least 20%, 30%, or 40%, and preferably at least 50%, 60%, or 70%, and 

20 most preferably at least 80%, 90%, or 95% that of the native (non-synthetic), 
endogenous polypeptide. Further, the substrate specificity (k^/KJ is optionally 
substantially similar to the native (non-synthetic), endogenous polypeptide. 
Typically, the K m will be at least 30%, 40%, or 50%, that of the native (non- 
synthetic), endogenous polypeptide; and more preferably at least 60%, 70%, 

25 80%, or 90%. Methods of assaying and quantifying measures of enzymatic 

activity and substrate specificity (k^/K^, are well known to those of skill in the art. 

The present invention includes modifications that can be made to an 
inventive protein. In particular, it may be desirable to diminish the activity of the 
LEC1 gene. Other modifications may be made to facilitate the cloning, 

30 expression, or incorporation of the targeting molecule into a fusion protein. Such 
modifications are well known to those of skill in the art and include, for example, a 
methionine added at the amino terminus to provide an initiation site, or additional 
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amino acids (e.g., poly His) placed on either terminus to create conveniently 
located restriction sites or termination codons or purification sequences. 

Using the nucleic acids of the present invention, one may express a protein 
of the present invention in a recombinantly engineered cell such as bacteria, 
5 yeast, insect, mammalian, or preferably plant cells. The cells produce the protein 
in a non-natural condition (e.g., in quantity, composition, location, and/or time), 
because they have been genetically altered through human intervention to do so. 

Typically, an intermediate host cell will be used in the practice of this 
invention to increase the copy number of the cloning vector. With an increased 
10 copy number, the vector containing the gene of interest can be isolated in 
significant quantities for introduction into the desired plant cells. 

Host cells that can be used in the practice of this invention include 
prokaryotes, including bacterial hosts such as Eschericia co//, Salmonella 
typhimurium, and Serratia marcescens. Eukaryotic hosts such as yeast or 
15 filamentous fungi may also be used in this invention. Since these hosts are also 
microorganisms, it will be essential to ensure that plant promoters which do not 
cause expression of the polypeptide in bacteria are used in the vector. 

Commonly used prokaryotic control sequences include such commonly 
used promoters as the beta lactamase (penicillinase) and lactose (lac) promoter 
20 systems (Chang et a/., Nature 198:1056 (1977)), the tryptophan (trp) promoter 
system (Goeddel et a/., Nucleic Acids Res. 8:4057 (1980)) and the lambda 
derived P L promoter and N-gene ribosome binding site (Shimatake et a/., Nature 
292:128 (1981)). The inclusion of selection markers in DNA vectors transfected in 

E. coli is also useful. Examples of such markers include genes specifying 
25 resistance to ampicillin, tetracycline, or chloramphenicol. 

The vector is selected to allow introduction into the appropriate host cell. 
Bacterial vectors are typically of plasmid or phage origin. Expression systems for 
expressing a protein of the present invention are available using Bacillus sp. and 
Salmonella (Palva, et a/., Gene 22:229-235 (1983); Mosbach, et a/., Nature 
30 302:543-545 (1983)). 

Synthesis of heterologous proteins in yeast is well known. See Sherman, 

F. , et a/., Methods in Yeast Genetics, Cold Spring Harbor Laboratory (1982). Two 

widely utilized yeast for production of eukaryotic proteins are Saccharomyces 
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cerevisiae and Pichia pastoris. Vectors, strains, and protocols for expression in 
Saccharomyces and Pichia are known in the art and available from commercial 
suppliers (e.g., Invitrogen). Suitable vectors usually have expression control 
sequences, such as promoters, including 3-phosphoglycerate kinase or alcohol 
5 oxidase, and an origin of replication, termination sequences and the like as 
desired. 

A protein of the present invention, once expressed, can be isolated from 
yeast by lysing the cells and applying standard protein isolation techniques to the 
lysates. The monitoring of the purification process can be accomplished by using 
10 Western blot techniques or radioimmunoassay of other standard immunoassay 
techniques. 

The proteins of the present invention can also be constructed using non- 
cellular synthetic methods. Solid phase synthesis of proteins of less than about 
50 amino acids in length may be accomplished by attaching the C-terminal amino 

15 acid of the sequence to an insoluble support followed by sequential addition of the 
remaining amino acids in the sequence. Techniques for solid phase synthesis are 
described by Barany and Merrifield, Solid-Phase Peptide Synthesis, pp. 3-284 in 
The Peptides: Analysis, Synthesis, Biology. Vol. 2: Special Methods in Peptide 
Synthesis, Part A. ; Merrifield, etal., J. Am. Chem. Soc. 85:2149-2156 (1963), and 

20 Stewart et al, Solid Phase Peptide Synthesis, 2nd ed., Pierce Chem. Co., 
Rockford, III. (1984). Proteins of greater length may be synthesized by 
condensation of the amino and carboxy termini of shorter fragments. Methods of 
forming peptide bonds by activation of a carboxy terminal end (e.g., by the use of 
the coupling reagent N,N'-dicycylohexylcarbodiimide)) is known to those of skill. 

25 The proteins of this invention, recombinant or synthetic, may be purified to 

substantial purity by standard techniques well known in the art, including 
detergent solubilization, selective precipitation with such substances as 
ammonium sulfate, column chromatography, immunopurification methods, and 
others. See, for instance, R. Scopes, Protein Purification: Principles and Practice, 

30 Springer-Verlag: New York (1982); Deutscher, Guide to Protein Purification, 

Academic Press (1990). For example, antibodies may be raised to the proteins as 

described herein. Purification from E. coli can be achieved following procedures 

described in U.S. Patent No. 4,511,503. Detection of the expressed protein is 
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achieved by methods known in the art and include, for example, 
radioimmunoassays, Western blotting techniques or immunoprecipitation. 

The present invention further provides a method for modulating (i.e., 
increasing or decreasing) the concentration or composition of the polypeptides of 
5 the present invention in a plant or part thereof. Modulation can be effected by 
increasing or decreasing the concentration and/or the composition (i.e., the ratio 
of the polypeptides of the present invention) in a plant. 

The method comprises transforming a plant cell with an expression 
cassette comprising a polynucleotide of the present invention to obtain a 

10 transformed plant cell, growing the transformed plant cell under conditions 

allowing expression of the polynucleotide in the plant cell in an amount sufficient 
to modulate concentration and/or composition in the plant cell. 

In some embodiments, the content and/or composition of polypeptides of 
the present invention in a plant may be modulated by altering, in vivo or in vitro, 

1 5 the promoter of a non-isolated gene of the present invention to up- or down- 
regulate gene expression. In some embodiments, the coding regions of native 
genes of the present invention can be altered via substitution, addition, insertion, 
or deletion to decrease activity of the encoded enzyme. See, e.g., Kmiec, U.S. 
Patent 5,565,350; Zarling et a/., PCT/US93/03868. One method of down- 

20 regulation of the protein involves using PEST sequences that provide a target for 
degradation of the protein. It has been observed that high levels of LEC1 prevent 
germination. See Lotan et al, Cell 1998 June 26; 93(7): 1195-1205. Thus, 
temporal regulation of LEC1 expression may be desirable in certain species to 
permit proper germination, vegetative growth, flowering and reproduction. 

25 In some embodiments, an isolated nucleic acid (e.g., a vector) comprising a 

promoter sequence is transfected into a plant cell. Subsequently, a plant cell 
comprising the promoter operably linked to a polynucleotide of the present 
invention is selected for by means known to those of skill in the art such as, but 
not limited to, Southern blot, DNA sequencing, or PCR analysis using primers 

30 specific to the promoter and to the gene and detecting amplicons produced 

therefrom. A plant or plant part altered or modified by the foregoing embodiments 
is grown under plant forming conditions for a time sufficient to modulate the 
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concentration and/or composition of polypeptides of the present invention in the 
plant. Plant forming conditions are well known in the art. 

In general, concentration or composition is increased or decreased by at 
least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% relative to a 
5 native control plant, plant part, or cell lacking the aforementioned expression 
cassette. Modulation in the present invention may occur during and/or 
subsequent to growth of the plant to the desired stage of development. 
Modulating nucleic acid expression temporally and/or in particular tissues can be 
controlled by employing the appropriate promoter operably linked to a 

10 polynucleotide of the present invention in, for example, sense or antisense 
orientation as discussed in greater detail, supra. Induction of expression of a 
polynucleotide of the present invention can also be controlled by exogenous 
administration of an effective amount of inducing compound. Inducible promoters 
and inducing compounds which activate expression from these promoters are well 

15 known in the art. In preferred embodiments, the polypeptides of the present 
invention are modulated in monocots or dicots, preferably maize, soybeans, 
sunflower, sorghum, canola, wheat, alfalfa, rice, barley and millet. 

Means of detecting the proteins of the present invention are not critical 
aspects of the present invention. In a preferred embodiment, the proteins are 

20 detected and/or quantified using any of a number of well recognized 

immunological binding assays (see, e.g., U.S. Patents 4,366,241; 4,376,110; 
4,517,288; and 4,837,168). For a review of the general immunoassays, see also 
Methods in Cell Biology, Vol. 37; Antibodies in Cell Biology, Asai, Ed., Academic 
Press, Inc. New York (1993); Basic and Clinical Immunology 7th Edition, Stites & 

25 Terr, Eds. (1991). Moreover, the immunoassays of the present invention can be 
performed in any of several configurations, e.g., those reviewed in Enzyme 
Immunoassay, Maggio, Ed., CRC Press, Boca Raton, Florida (1980); Tijan, 
Practice and Theory of Enzyme Immunoassays, Laboratory Techniques in 
Biochemistry and Molecular Biology, Elsevier Science Publishers B.V., 

30 Amsterdam (1985); Harlow and Lane, supra] Immunoassay: A Practical Guide, 

Chan, Ed., Academic Press, Orlando, FL (1987); Principles and Practice of 

Immunoassays, Price and Newman Eds., Stockton Press, NY (1991); and Non- 

isotopic Immunoassays, Ngo, Ed., Plenum Press, NY (1988). 
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Typical methods include Western blot (immunoblot) analysis, analytic 
biochemical methods such as electrophoresis, capillary electrophoresis, high 
performance liquid chromatography (HPLC), thin layer chromatography (TLC), 
hyperdiffusion chromatography, and the like, and various immunological methods 
5 such as fluid or gel precipitin reactions, immunodiffusion (single or double), 
Immunoelectrophoresis, radioimmunoassays (RIAs), enzyme-linked 
immunosorbent assays (ELISAs), immunofluorescent assays, and the like. 

Non-radioactive labels are often attached by indirect means. Generally, a 
ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then 

10 binds to an anti-ligand (e.g., streptavidin) molecule which is either inherently 

detectable or covalently bound to a signal system, such as a detectable enzyme, 
a fluorescent compound, or a chemiluminescent compound. A number of ligands 
and anti-ligands can be used. Where a ligand has a natural anti-ligand, for 
example, biotin, thyroxine, and Cortisol, it can be used in conjunction with the 

15 labeled, naturally occurring anti-ligands. Alternatively, any haptenic or antigenic 
compound can be used in combination with an antibody. 

The molecules can also be conjugated directly to signal generating 
compounds, e.g., by conjugation with an enzyme or fluorophore. Enzymes of 
interest as labels will primarily be hydrolases, particularly phosphatases, 

20 esterases and glycosidases, or oxidoreductases, particularly peroxidases. 

Fluorescent compounds include fluorescein and its derivatives, rhodamine and its 
derivatives, dansyl, umbelliferone, etc. Chemiluminescent compounds include 
luciferin, and 2,3-dihydrophthalazinediones, e.g., luminol. For a review of various 
labeling or signal producing systems which may be used, see, U.S. Patent No. 

25 4,391,904, which is incorporated herein by reference. 

Some assay formats do not require the use of labeled components. For 
instance, agglutination assays can be used to detect the presence of the target 
antibodies. In this case, antigen-coated particles are agglutinated by samples 
comprising the target antibodies. In this format, none of the components need be 

30 labeled and the presence of the target antibody is detected by simple visual 
inspection. 

The proteins of the present invention can be used for identifying 

compounds that bind to (e.g., substrates), and/or increase or decrease (i.e., 
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modulate) the enzymatic activity of, catalytically active polypeptides of the present 
invention. The method comprises contacting a polypeptide of the present 
invention with a compound whose ability to bind to or modulate enzyme activity is 
to be determined. The polypeptide employed will have at least 20%, preferably at 
5 least 30% or 40%, more preferably at least 50% or 60%, and most preferably at 
least 70% or 80% of the specific activity of the native, full-length polypeptide of the 
present invention (e.g., enzyme). Methods of measuring enzyme kinetics are well 
known in the art. See, e.g., Segel, Biochemical Calculations, 2 nd ed., John Wiley 
and Sons, New York (1976). 

10 Antibodies can be raised to a protein of the present invention, including 

individual, allelic, strain, or species variants, and fragments thereof, both in their 
naturally occurring (full-length) forms and in recombinant forms. Additionally, 
antibodies are raised to these proteins in either their native configurations or in 
non-native configurations. Anti-idiotypic antibodies can also be generated. Many 

15 methods of making antibodies are known to persons of skill. 

In some instances, it is desirable to prepare monoclonal antibodies from 
various mammalian hosts, such as mice, rodents, primates, humans, etc. 
Description of techniques for preparing such monoclonal antibodies are found in, 
e.g., Basic and Clinical Immunology, 4th ed., Stites et a/., Eds., Lange Medical 

20 Publications, Los Altos, CA, and references cited therein; Harlow and Lane, 
Supra] Goding, Monoclonal Antibodies: Principles and Practice, 2nd ed., 
Academic Press, New York, NY (1986); and Kohler and Milstein, Nature 256:495- 
497 (1975). 

Other suitable techniques involve selection of libraries of recombinant 
25 antibodies in phage or similar vectors (see, e.g., Huse et a/., Science 246:1275- 
1281 (1989); and Ward, et al, Nature 341:544-546 (1989); and Vaughan et a/., 
Nature Biotechnology, 14:309-314 (1996)). Alternatively, high avidity human 
monoclonal antibodies can be obtained from transgenic mice comprising 
fragments of the unrearranged human heavy and light chain Ig loci (i.e., minilocus 
30 transgenic mice). Fishwild etai, Nature Biotech., 14:845-851 (1996). Also, 

recombinant immunoglobulins may be produced. See, Cabilly, U.S. Patent No. 
4,816,567; and Queen etai, Proc. Natl. Acad. Sci. 86:10029-10033 (1989). 



-29- 



The antibodies of this invention can be used for affinity chromatography in 
isolating proteins of the present invention, for screening expression libraries for 
particular expression products such as normal or abnormal protein or for raising 
anti-idiotypic antibodies which are useful for detecting or diagnosing various 
5 pathological conditions related to the presence of the respective antigens. 

Frequently, the proteins and antibodies of the present invention will be 
labeled by joining, either covalently or non-covalently, a substance which provides 
for a detectable signal. A wide variety of labels and conjugation techniques are 
known and are reported extensively in both the scientific and patent literature. 
10 Suitable labels include radionucleotides, enzymes, substrates, cofactors, 

inhibitors, fluorescent moieties, chemiluminescent moieties, magnetic particles, 
and the like. 

Transfection/Transformation of Cells 

15 The method of transformation/transfection is not critical to the instant 

invention; various methods of transformation or transfection are currently 
available. As newer methods are available to transform crops or other host cells 
they may be directly applied. Accordingly, a wide variety of methods have been 
developed to insert a DNA sequence into the genome of a host cell to obtain the 

20 transcription and/or translation of the sequence to effect phenotypic changes in 
the organism. Thus, any method which provides for efficient 
transformation/transfection may be employed. 

A DNA sequence coding for the desired polynucleotide of the present 
invention, for example a cDNA or a genomic sequence encoding a full length 

25 protein, can be used to construct an expression cassette which can be introduced 
into the desired plant. Isolated nucleic acid acids of the present invention can be 
introduced into plants according techniques known in the art. Generally, 
expression cassettes as described above and suitable for transformation of plant 
cells are prepared. 

30 Techniques for transforming a wide variety of higher plant species are well 

known and described in the technical, scientific, and patent literature. See, for 

example, Weising et a/., Ann. Rev. Genet 22:421-477 (1988). For example, the 

DNA construct may be introduced directly into the genomic DNA of the plant cell 

-30- 



using techniques such as electroporation, PEG poration, particle bombardment, 
silicon fiber delivery, or microinjection of plant cell protoplasts or embryogenic 
callus. See, e.g., Tomes, etaL, Direct DNA Transfer into Intact Plant Cells Via 
Microprojectile Bombardment, pp.1 97-21 3 in Plant Cell, Tissue and Organ 
5 Culture, Fundamental Methods, eds. O. L. Gamborg and G.C. Phillips. Springer- 
Verlag Berlin Heidelberg New York, 1995. Alternatively, the DNA constructs may 
be combined with suitable T-DNA flanking regions and introduced into a 
conventional Agrobacterium tumefaciens host vector. The virulence functions of 
the Agrobacterium tumefaciens host will direct the insertion of the construct and 

10 adjacent marker into the plant cell DNA when the cell is infected by the bacteria. 
See, U.S. Patent No. 5,591,616. 

The introduction of DNA constructs using polyethylene glycol precipitation 
is described in Paszkowski et aL } Embo J. 3:2717-2722 (1984). Electroporation 
techniques are described in Fromm et a/., Proc. Natl. Acad. Sci. 82:5824 (1985). 

15 Ballistic transformation techniques are described in Klein et al., Nature 327:70-73 
(1987). 

Agrobacterium tumefaciens-med'ftated transformation techniques are well 
described in the scientific literature. See, for example Horsch et al., Science 
233:496-498 (1984), and Fraley etaL, Proc. Natl. Acad. Sci. 80:4803 (1983). For 

20 instance, Agrobacterium transformation of maize is described in WO 98/32326. 
Agrobacterium transformation of soybean is described in US Pat. No. 5,563,055. 

Other methods of transfection or transformation include (1) Agrobacterium 
rhizogenes-mediated transformation (see, e.g., Lichtenstein and Fuller In: Genetic 
Engineering, Vol. 6, PWJ Rigby, Ed., London, Academic Press, 1987; and 

25 Lichtenstein, C. P., and Draper, J,. In: DNA Cloning, Vol. II, D. M. Glover, Ed., 
Oxford, IRI Press, 1985), Application PCT/US87/02512 (WO 88/02405 published 
Apr. 7, 1988) describes the use of A. rhizogenes strain A4 and its Ri plasmid 
along with A. tumefaciens vectors pARC8 or pARC16 (2) liposome-mediated DNA 
uptake (see, e.g., Freeman etaL, Plant Cell Physiol. 25:1353, (1984)), (3) the 

30 vortexing method (see, e.g., Kindle, Proc. Natl. Acad. Sci. USA 87:1228, (1990)). 

DNA can also be introduced into plants by direct DNA transfer into pollen 

as described by Zhou etaL, Methods in Enzymology, 101:433 (1983); D. Hess, 

Intern Rev. CytoL, 107:367 (1987); Luo etaL, Plane Mol. Biol. Reporter, 6:165 
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(1988). Expression of polypeptide coding polynucleotides can be obtained by 
injection of the DNA into reproductive organs of a plant as described by Pena et 
a/., Nature, 325:274 (1987). DNA can also be injected directly into the cells of 
immature embryos and the rehydration of desiccated embryos as described by 
5 Neuhaus et a/., Theor. Appl. Genet, 75:30 (1987); and Benbrook et a/., in 
Proceedings Bio Expo 1986, Butterworth, Stoneham, Mass., pp. 27-54 (1986). 

Animal and lower eukaryotic (e.g., yeast) host cells are competent or 
rendered competent for transfection by various means. There are several well- 
known methods of introducing DNA into animal cells. These include: calcium 

10 phosphate precipitation, fusion of the recipient cells with bacterial protoplasts 

containing the DNA, treatment of the recipient cells with liposomes containing the 
DNA, DEAE dextran, electroporation, biolistics, and micro-injection of the DNA 
directly into the cells. The transfected cells are cultured by means well known in 
the art. Kuchler, R.J., Biochemical Methods in Cell Culture and Virology, Dowden, 

15 Hutchinson and Ross, Inc. (1977). 

Altering the culture medium to suppress somatic embryogenesis in non- 
transformed plant cells and/or tissues to provide for a positive section 
means of transformed plant cells 

20 Using the following methods for controlling somatic embryogenesis, it is 

possible to alter plant tissue culture media components to suppress somatic 
embryogenesis in a plant species of interest (often having multiple components 
that potentially could be adjusted to impart this effect). Such conditions would not 
impart a negative or toxic in vitro environment for wild-type tissue, but instead 

25 would simply not produce a somatic embryogenic growth form. Introducing a 
transgene such as LEC1 will stimulate somatic embryogenesis and growth in the 
transformed cells or tissue, providing a clear differential growth screen useful for 
identifying transformants. 

Altering a wide variety of media components can modulate somatic 

30 embryogenesis (either stimulating or suppressing embryogenesis depending on 
the species and particular media component). Examples of media components 
which, when altered, can stimulate or suppress somatic embryogenesis include; 
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1) the basal medium itself (macronutrient, micron utrients and vitamins; see T.A. 
Thorpe, 1981 for review, "Plant Tissue Culture: Methods and Applications in 
Agriculture", Academic Press, NY), 

2) plant phytohormones such as auxins (indole acetic acid, indole butyric acid, 
5 2,4-dichlorophenoxyacetic acid, naphthaleneacetic acid, picloram, dicamba 

and other functional analogues), cytokinins (zeatin, kinetin, benzyl amino 
purine, 2-isopentyl adenine and functionally-related compounds) abscisic acid, 
adenine, and gibberellic acid, 

3) and other compounds that exert "growth regulator" effects such as coconut 
10 water, casein hydrolysate, and proline, and 

4) the type and concentration of gelling agent, pH and sucrose concentration. 

Changes in the individual components listed above (or in some cases 
combinations of components) have been demonstrated in the literature to 
modulate in vitro somatic embryogenesis across a wide range of dicotyledonous 
15 and monocotyledonous species. For a compilation of examples, see E.F. George 
etal. 1987. Plant Tissue Culture Media. Vol. 1: Formulations and Uses. 
Exergetics, Ltd., Publ., Edington, England. 

Transgenic Plant Regeneration 

20 Transformed plant cells which are derived by any of the above 

transformation techniques can be cultured to regenerate a whole plant which 
possesses the transformed genotype. Such regeneration techniques often rely on 
manipulation of certain phytohormones in a tissue culture growth medium, 
typically relying on a biocide and/or herbicide marker that has been introduced 

25 together with a polynucleotide of the present invention. For transformation and 
regeneration of maize see, Gordon-Kamm et ai, The Plant Cell, 2:603-618 
(1990). 

Plants cells transformed with a plant expression vector can be regenerated, 

e.g., from single cells, callus tissue or leaf discs according to standard plant tissue 

30 culture techniques. It is well known in the art that various cells, tissues, and 

organs from almost any plant can be successfully cultured to regenerate an entire 

plant. Plant regeneration from cultured protoplasts is described in Evans et ai, 

Protoplasts Isolation and Culture, Handbook of Plant Cell Culture, Macmillan 
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Publishing Company, New York, pp. 124-176 (1983); and Binding, Regeneration 
of Plants, Plant Protoplasts, CRC Press, Boca Raton, pp. 21-73 (1985). 

The regeneration of plants containing the foreign gene introduced by 
Agrobacterium can be achieved as described by Horsch et a/., Science, 
5 227:1229-1231 (1985) and Fraley et a/., Proc. Natl. Acad, Sci. U.S.A. 80:4803 
(1983). This procedure typically produces shoots within two to four weeks and 
these transformant shoots are then transferred to an appropriate root-inducing 
medium containing the selective agent and an antibiotic to prevent bacterial 
growth. Transgenic plants of the present invention may be fertile or sterile. 

10 Regeneration can also be obtained from plant callus, explants, organs, or 

parts thereof. Such regeneration techniques are described generally in Klee et 
al., Ann. Rev. of Plant Phys. 38:467-486 (1987). The regeneration of plants from 
either single plant protoplasts or various explants is well known in the art. See, for 
example, Methods for Plant Molecular Biology, A. Weissbach and H. Weissbach, 

15 eds., Academic Press, Inc., San Diego, Calif. (1988). For maize cell culture and 
regeneration see generally, The Maize Handbook, Freeling and Walbot, Eds., 
Springer, New York (1994); Corn and Corn Improvement, 3 rd edition, Sprague and 
Dudley Eds., American Society of Agronomy, Madison, Wisconsin (1988). 
One of skill will recognize that after the expression cassette is stably 

20 incorporated in transgenic plants and confirmed to be operable, it can be 

introduced into other plants by sexual crossing. Any of a number of standard 
breeding techniques can be used, depending upon the species to be crossed. 

In vegetatively propagated crops, mature transgenic plants can be 
propagated by the taking of cuttings, via production of apomictic seed, or by tissue 

25 culture techniques to produce multiple identical plants. Selection of desirable 
transgenics is made and new varieties are obtained and propagated vegetatively 
for commercial use. In seed propagated crops, mature transgenic plants can be 
self crossed to produce a homozygous inbred plant. The inbred plant produces 
seed containing the newly introduced heterologous nucleic acid. These seeds 

30 can be grown to produce plants that would produce the selected phenotype. 

Parts obtained from the regenerated plant, such as flowers, seeds, leaves, 

branches, fruit, and the like are included in the invention, provided that these parts 

comprise cells comprising the isolated nucleic acid of the present invention. 
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Progeny and variants, and mutants of the regenerated plants are also included 
within the scope of the invention, provided that these parts comprise the 
introduced nucleic acid sequences. 

Transgenic plants expressing a selectable marker can be screened for 
5 transmission of the nucleic acid of the present invention by, for example, standard 
immunoblot and DNA detection techniques. Transgenic lines are also typically 
evaluated on levels of expression of the heterologous nucleic acid. Expression at 
the RNA level can be determined initially to identify and quantitate expression- 
positive plants. Standard techniques for RNA analysis can be employed and 

10 include PCR amplification assays using oligonucleotide primers designed to 
amplify only the heterologous RNA templates and solution hybridization assays 
using heterologous nucleic acid-specific probes. The RNA-positive plants can 
then analyzed for protein expression by Western immunoblot analysis using the 
specifically reactive antibodies of the present invention. In addition, in situ 

15 hybridization and immunocytochemistry according to standard protocols can be 
done using heterologous nucleic acid specific polynucleotide probes and 
antibodies, respectively, to localize sites of expression within transgenic tissue. 
Generally, a number of transgenic lines are usually screened for the incorporated 
nucleic acid to identify and select plants with the most appropriate expression 

20 profiles. 

A preferred embodiment is a transgenic plant that is homozygous for the 
added heterologous nucleic acid; i.e., a transgenic plant that contains two added 
nucleic acid sequences, one gene at the same locus on each chromosome of a 
chromosome pair. A homozygous transgenic plant can be obtained by sexually 

25 mating (selfing) a heterozygous transgenic plant that contains a single added 

heterologous nucleic acid, germinating some of the seed produced and analyzing 
the resulting plants produced for altered expression of a polynucleotide of the 
present invention relative to a control plant (i.e., native, non-transgenic). Back- 
crossing to a parental plant and out-crossing with a non- transgenic plant are also 

30 contemplated. Alternatively, propagation of heterozygous transgenic plants could 

be accomplished through apomixis. 

The present invention provides a method of genotyping a plant comprising 

a polynucleotide of the present invention. Genotyping provides a means of 
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distinguishing homologs of a chromosome pair and can be used to differentiate 
segregants in a plant population. Molecular marker methods can be used for 
phylogenetic studies, characterizing genetic relationships among crop varieties, 
identifying crosses or somatic hybrids, localizing chromosomal segments affecting 
5 monogenic traits, map based cloning, and the study of quantitative inheritance. 
See, e.g., Plant Molecular Biology: A Laboratory Manual, Chapter 7, Clark, Ed., 
Springer-Verlag, Berlin (1997). For molecular marker methods, see generally, 
The DNA Revolution by Andrew H. Paterson 1996 (Chapter 2) in: Genome 
Mapping in Plants (ed. Andrew H. Paterson) by Academic Press/R. G. Landis 

10 Company, Austin, Texas, pp.7-21. 

The particular method of genotyping in the present invention may employ 
any number of molecular marker analytic techniques such as, but not limited to, 
restriction fragment length polymorphisms (RFLPs). RFLPs are the product of 
allelic differences between DNA restriction fragments caused by nucleotide 

15 sequence variability. Thus, the present invention further provides a means to 
follow segregation of a gene or nucleic acid of the present invention as well as 
chromosomal sequences genetically linked to these genes or nucleic acids using 
such techniques as RFLP analysis. 

Plants which can be used in the method of the invention include 

20 monocotyledonous and dicotyledonous plants. Preferred plants include maize, 
wheat, rice, barley, oats, sorghum, millet, rye, soybean, sunflower, alfalfa, canola 
and cotton. 

Seeds derived from plants regenerated from transformed plant cells, plant 
parts or plant tissues, or progeny derived from the regenerated transformed 
25 plants, may be used directly as feed or food, or further processing may occur. 

All publications cited in this application are herein incorporated by 
reference to the same extent as if each individual publication or patent application 
was specifically and individually indicated to be incorporated by reference. 

The present invention will be further described by reference to the following 
30 detailed examples. It is understood, however, that there are many extensions, 
variations, and modifications on the basic theme of the present invention beyond 
that shown in the examples and description, which are within the spirit and scope 
of the present invention. 
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EXAMPLES 



Example 1 

Library construction used for the maize LEC1 EST's 

5 

A. Total RNA Isolation 

Total RNA was isolated from maize embryo and regenerating callus tissues 
with TRIzol Reagent (Life Technology Inc. Gaithersburg, MD) using a modification 
of the guanidine isothiocyanate/acid-phenol procedure described by Chomczynski 

10 and Sacchi (Chomczynski, P., and Sacchi, N. Anal. Biochem. 162, 156 (1987)). 
In brief, plant tissue samples were pulverized in liquid nitrogen before the addition 
of the TRIzol Reagent, and then were further homogenized with a mortar and 
pestle. Addition of chloroform followed by centrifugation was conducted for 
separation of an aqueous phase and an organic phase. The total RNA was 

15 recovered by precipitation with isopropyl alcohol from the aqueous phase. 

B. Poly(A)+ RNA Isolation 

The selection of po!y(A)+ RNA from total RNA was performed using 
PolyATact system (Promega Corporation. Madison, Wl). In brief, biotinylated 
20 oligo(dT) primers were used to hybridize to the 3' poly(A) tails on mRNA. The 

hybrids were captured using streptavidin coupled to paramagnetic particles and a 
magnetic separation stand. The mRNA was washed at high stringent condition 
and eluted by RNase-free deionized water. 

25 C. cDNA Library Construction 

cDNA synthesis was performed and unidirectional cDNA libraries were 

constructed using the Superscript Plasmid System (Life Technology Inc. 

Gaithersburg, MD). The first stand of cDNA was synthesized by priming an 

oligo(dT) primer containing a Not I site. The reaction was catalyzed by 

30 Superscript Reverse Transcriptase II at 45°C. The second strand of cDNA was 

labeled with alpha- 32 P-dCTP and a portion of the reaction was analyzed by 

agarose gel electrophoresis to determine cDNA sizes. cDNA molecules smaller 

than 500 base pairs and unligated adapters were removed by Sephacryl-S400 
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chromatography. The selected cDNA molecules were ligated into pSPORTI 
vector in between of Not I and Sal I sites. 



Example 2 

5 Sequencing and cDNA subtraction procedures used for maize LEC1 EST's 



A. Sequencing Template Preparation 

Individual colonies were picked and DNA was prepared either by PCR with 
M13 forward primers and M13 reverse primers, or by plasmid isolation. All the 
10 cDNA clones were sequenced using M13 reverse primers. 



B. Q-bot Subtraction Procedure 

cDNA libraries subjected to the subtraction procedure were plated out on 
22 x 22 cm 2 agar plate at density of about 3,000 colonies per plate. The plates 

15 were incubated in a 37°C incubator for 12-24 hours. Colonies were picked into 
384-well plates by a robot colony picker, Q-bot (GENETIX Limited). These plates 
were incubated overnight at 37°C. 

Once sufficient colonies were picked, they were pinned onto 22 x 22 cm 2 
nylon membranes using Q-bot. Each membrane contained 9,216 colonies or 

20 36,864 colonies. These membranes were placed onto agar plate with appropriate 
antibiotic. The plates were incubated at 37°C for overnight. 

After colonies were recovered on the second day, these filters were placed 
on filter paper prewetted with denaturing solution for four minutes, then were 
incubated on top of a boiling water bath for additional four minutes. The filters 

25 were then placed on filter paper prewetted with neutralizing solution for four 
minutes. After excess solution was removed by placing the filters on dry filter 
papers for one minute, the colony side of the filters were place into Proteinase K 
solution, incubated at 37°C for 40-50 minutes. The filters were placed on dry filter 
papers to dry overnight. DNA was then cross-linked to nylon membrane by UV 

30 light treatment. 

Colony hybridization was conducted as described by Sambrook, J., Fritsch, 

E.F. and Maniatis, T., (in Molecular Cloning: A laboratory Manual, 2 nd Edition). 

The following probes were used in colony hybridization: 
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1. First strand cDNA from the same tissue from which the library was made to 
remove the most redundant clones. 

2. 48-192 most redundant cDNA clones from the same library based on previous 
sequencing data. 

5 3. 1 92 most redundant cDNA clones in the entire corn sequence database. 

4. A Sal-A20 oligo nucleotide: TCG ACC CAC GCG TCC GAA AAA AAA AAA 
AAA AAA AAA, removes clones containing a poly A tail but no cDNA. 

5. cDNA clones derived from rRNA. 

10 The image of the autoradiography was scanned into computer and the signal 
intensity and cold colony addresses of each colony was analyzed. Re-arraying of 
cold-colonies from 384 well plates to 96 well plates was conducted using Q-bot. 

Example 3 

15 Identification of Maize LEC1 EST's from a Computer Homology Search 

Gene identities were determined by conducting BLAST (Basic Local 
Alignment Search Tool; Altschul, S. F., et a/., (1993) J. Mol. Biol. 215:403-410; 
see also www.ncbi.nlm.nih.gov/BLAST/) searches under default parameters for 

20 similarity to sequences contained in the BLAST "nr" database (comprising all non- 
redundant GenBank CDS translations, sequences derived from the 3-dimensional 
structure Brookhaven Protein Data Bank, the last major release of the SWISS- 
PROT protein sequence database, EMBL, and DDBJ databases). The cDNA 
sequences were analyzed for similarity to all publicly available DNA sequences 

25 contained in the "nr" database using the BLASTN algorithm. The DNA sequences 
were translated in all reading frames and compared for similarity to all publicly 
available protein sequences contained in the "nr" database using the BLASTX 
algorithm (Gish, W. and States, D. J. (1993) Nature Genetics 3:266-272) provided 
by the NCBI. In some cases, the sequencing data from two or more clones 

30 containing overlapping segments of DNA were used to construct contiguous DNA 
sequences. 
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EXAMPLE 4 

Composition of cDNA Libraries Used to Isolate and Sequence 
Additional cDNA Clones 

5 cDNA libraries representing mRNAs from various corn, poppy, soybean 

and Vernonia tissues were prepared (see Table 1). The characteristics of the 
libraries are described below. 

10 TABLE 1 

cDNA Libraries from Corn, Poppy, Soybean and Vernonia 

Tissue 

Corn endosperm 20 days After pollination* 

Prickly poppy developing seeds 

Soybean embryo, 6 to 10 days after flowering 

Soybean embryo, 13 days after flowering 

Soybean embryogenic suspension 2 weeks after 

subculture 

Soybean mature embryo 8 weeks after subculture 
Soybean embryogenic suspension 
Vernonia developing seed* 

*These libraries were normalized essentially as described in U.S. Pat. No. 
15 5,482,845, incorporated herein by reference. 

cDNA libraries were prepared in Uni-ZAP™XR vectors according to the 
manufacturer's protocol (Stratagene Cloning Systems, La Jolla, CA). Conversion 
of the Uni-ZAP™XR libraries into plasmid libraries was accomplished according 

20 to the protocol provided by Stratagene. Upon conversion, cDNA inserts were 
contained in the plasmid vector pBluescript. cDNA inserts from randomly picked 
bacterial colonies containing recombinant pBluescript plasmids were amplified via 
polymerase chain reaction using primers specific for vector sequences flanking 
the inserted cDNA sequences or plasmid DNA was prepared from cultured 

25 bacterial cells. Amplified insert DNAs or plasmid DNAs were sequenced in dye- 
primer sequencing reactions to generate partial cDNA sequences (expressed 
sequence tags or "ESTs"; see Adams, M. D. et aL, (1991) Science 252:1651). 
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The resulting ESTs were analyzed using a Perkin Elmer Model 377 fluorescent 
sequencer. 



EXAMPLE 5 

5 Identification of cDNA Clones obtained from tissue described in Table 1 

ESTs encoding plant transcription factors were identified by conducting 
BLAST (Basic Local Alignment Search Tool; Altschul, S. F., et a/., (1993) J. Mol. 
Biol. 275:403-410; see also www.ncbi.nlm.nih.gov/BLAST/) searches for similarity 

10 to sequences contained in the BLAST "nr" database (comprising all non- 
redundant GenBank CDS translations, sequences derived from the 3-dimensionaI 
structure Brookhaven Protein Data Bank, the last major release of the SWISS- 
PROT protein sequence database, EMBL, and DDBJ databases). The cDNA 
sequences obtained in Example 1 were analyzed for similarity to all publicly 

15 available DNA sequences contained in the "nr" database using the BLASTN 

algorithm provided by the National Center for Biotechnology Information (NCBI). 
The DNA sequences were translated in all reading frames and compared for 
similarity to all publicly available protein sequences contained in the "nr" database 
using the BLASTX algorithm (Gish, W. and States, D. J. (1993) Nature Genetics 

20 3:266-272 and Altschul, Stephen F., et al. (1 997) Nucleic Acids Res. 25:3389- 
3402) provided by the NCBI. 

Example 6 

Identification of protein motifs diagnostic for LEC1 genes. 

25 

To determine the structural requirements for a LEC1 gene, HAP3 homologs 
were identified in our EST database and aligned. By analyzing sequence 
homology amongst the plant HAP3 family of transcriptional activators these 
sequences were observed to fall into a least two distinctive groups. All of the 
30 HAP3 sequences derived from seed or embryo specific libraries form a distinctive 
LEC1 group that suggests a common evolutionary origin (confirmed by 
phylodentrograms). For example within the "B domain" of all plant LEC1 
sequences examined, a highly conserved CCAAT-box binding motif has been 
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found to contain the non-variable residues methionine, proline, isoleucine, 
alanine, asparagine, valine, and isoleucine (MPIANVI). LEC1 genes are highly 
divergent outside of the region spanning the DNA binding and subunit interaction 
motifs. The low levels of homology between these genes make it difficult to 
5 identify these based solely on a hybridization strategy. Using sequences from 
maize, soybean, wheat, prickly poppy, Vernonia, and Arabidopsis a motif 
diagnostic for LEC1 genes was identified, the specific amino acid substitutions for 
these species was clarified (Figure 1) and the positions at which amino acid 
substitutions occur within the LEC1 group was determined (SEQ ID NO: 23). 
10 Using Blast, the motif in SEQ ID NO: 23 was used to correctly distinguish LECI's 
from other closely related plant HAP3 transcriptional activators. 

Example 7 

Transformation and Regeneration of Maize Callus 

15 

Immature maize embryos from greenhouse or field grown High type II 
donor plants were bombarded with a plasmid containing a polynucleotide of the 
invention (LEC1). The LEC1 polynucleotide was operably linked to a constitutive 
promoter such as nos, or an inducible promoter, such as In2, plus a plasmid 
20 containing the selectable marker gene PAT (Wohlleben et a/. (1988) Gene 70:25- 
37) that confers resistance to the herbicide Bialaphos fused to the Green 
Fluorescence protein. Transformation was performed as follows. 

The ears were surface sterilized in 50% Chlorox bleach plus 0.5% Micro 
detergent for 20 minutes, and rinsed two times with sterile water. The immature 
25 embryos were excised and placed embryo axis side down (scutellum side up), 25 
embryos per plate. These were cultured on 560 L medium 4 days prior to 
bombardment in the dark. Medium 560 L is an N6-based medium containing 
Eriksson's vitamins, thiamine, sucrose, 2,4-D, and silver nitrate. The day of 
bombardment, the embryos were transferred to 560 Y medium for 4 hours and 
30 were arranged within the 2.5-cm target zone. Medium 560Y is a high osmoticum 
medium (560L with high sucrose concentration). 

A plasmid vector comprising a polynucleotide of the invention operably 
linked to the selected promoter was constructed. This plasmid DNA plus plasmid 
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DNA containing a PAT selectable marker was precipitated onto 1.1 (am (average 
diameter) tungsten pellets using a CaCI 2 precipitation procedure as follows: 100 jal 
prepared tungsten particles (0.6 mg) in water, 20 [i\ (2 |ug) DNA in TrisEDTA 
buffer (1 jo,g total), 100 \i\ 2.5 M CaC1 2) 40 jd 0.1 M spermidine. 
5 Each reagent was added sequentially to the tungsten particle suspension. 

The final mixture was sonicated briefly. After the precipitation period, the tubes 
were centrifuged briefly, liquid removed, washed with 500 ml 100% ethanol, and 
centrifuged again for 30 seconds. Again the liquid was removed, and 60 pJ 100% 
ethanol was added to the final tungsten particle pellet. For particle gun 
10 bombardment, the tungsten/DNA particles were briefly sonicated and 5 jil spotted 
onto the center of each macrocarrier and allowed to dry about 2 minutes before 
bombardment. 

The sample plates were bombarded at a distance of 8 cm from the 
stopping screen to the tissue, using a Dupont biolistics helium particle gun. All 

15 samples received a single shot at 650 PSI, with a total of ten aliquots taken from 
each tube of prepared particles/DNA. 

Four to 12 hours post bombardment, the embryos were moved to 560P (a 
low osmoticum callus initiation medium similar to 560L but with lower silver 
nitrate), for 3-7 days, then transferred to 560R selection medium, an N6 based 

20 medium similar to 560P containing 3 mg/Iiter Bialaphos, and subcultured every 2 
weeks. Multicellular GFP cell clusters became visible after two weeks and their 
numbers were periodically recorded. After approximately 10 weeks of selection, 
selection-resistant GFP positive callus clones were sampled for PCR and activity 
of the polynucleotide of interest. Positive lines were transferred to 288J medium, 

25 an MS-based medium with lower sucrose and hormone levels, to initiate plant 
regeneration. Following somatic embryo maturation (2-4 weeks), well-developed 
somatic embryos were transferred to medium for germination and transferred to 
the lighted culture room. Approximately 7-10 days later, developing plantlets were 
transferred to medium in tubes for 7-10 days until plantlets were well established. 

30 Plants were then transferred to inserts in flats (equivalent to 2.5" pot) containing 
potting soil and grown for 1 week in a growth chamber, subsequently grown an 
additional 1-2 weeks in the greenhouse, then transferred to Classic™ 600 pots 
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(1 .6 gallon) and grown to maturity. Plants are monitored for expression of the 
polynucleotide of interest. 

Example 8 

5 Ectopic Expression of Maize LEC1 to Induce Somatic Embryogenesis 

Using the genotype High type II as an example, embryos were isolated and 
cultured on 560L medium for 3-5 days. Four to twelve hours before bombardment 
these embryos were transferred to high osmotic 560Y medium. Expression 

10 cassettes containing the LEC1 cDNA were then co-introduced into the scutella of 
these embryos along with an expression cassette containing the Pat gene fused 
to the Green Fluorescent protein using methods described in Example 7. 
Embryos from a single ear were divided evenly between treatments. Four to 12 
hours following bombardment embryos were then transferred back to a low 

15 osmoticum callus initiation medium (560P) and incubated in the dark at 26°C. 
After 3-7 days of culture these embryos were moved to 560R selection medium. 
Cultures were then transferred every two weeks until transformed colonies 
appear. Cultures were also examined microscopically for GFP expression. LEC1 
expression was expected to stimulate adventive embryo formation. This was 

20 apparent when the cultures were compared to controls (transformed without the 
LEC1 cDNA or non-induced). 

A. Ectopic expression of the maize LEC1 polynucleotide in tobacco is 
sufficient to induce somatic embryogenesis in tobacco leaves 

25 A maize LEC1 polynucleotide was placed into an agrobacterium expression 

cassette driven by the maize safener-induced In2 promoter (this promoter is leaky 
and expresses at low levels without induction). Also between the left and right T- 
DNA borders was the bar gene driven by 35S promoter and the Green 
Fluorescence Protein driven by the ubiquitin promoter. A similar construct was 

30 made without the LEC1 polynucleotide to be used as a control. Tobacco leaf 

discs from variety SR1 were co-cultured with Agrobacterium as described by 

Horsch etai (1985, Science 227:1229-1231) except selecting with bialaphos 

rather than kanamycin. Transformants were selected on medium containing 3mg/I 
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bialaphos. Of the numerous transformed shoot obtained, ectopic embryos were 
visible on the leaves of a single LEC1 transformant. None were visible on control 
plants. Although the frequency was low, ectopic somatic embryo formation was 
also reported to be a rare event in Arabidopsis LEC1 transformants (Lotan et al. 
5 1998). 

B. Transformation frequency was improved by LEC1 introduced using 
particle-mediated DNA delivery. 

A series of expression cassettes were made to evaluate the effects of 

10 LEC1 expression on maize transformation. The maize LEC1 polynucleotide was 
placed under the control of the In2 promoter (weakly induced with the auxin levels 
used under normal culture conditions and strongly-induced with safener), the 
barley NUC1 promoter (expressed strongly in the nucellus), the Ubiquitin promoter 
(strongly expressed constitutively), and the nos promoter (weakly expressed 

15 constitutively). A frame-shift version of the ln2:LEC1 cassette was made along 
with an ln2:ZM-NF-YB (designated as ln2:HAP3 henceforth) construct (The maize 
ZM NF-YB is non-LEC1 type of HAP3 transcriptional activator (Li et al Nucleic 
Acids Res. 20:1087-1091) for use as negative controls. All of these constructs 
were co-bombarded with the Pat~GFP fusion construct (designated as PAT~GFP) 

20 into high type II embryos as described in Example 7. Also, as in Example 7, 
immature embryos were harvested from separate ears, and the embryos from 
each ear were divided equally between treatments to account for ear-to-ear 
variability (for example, in an experiment comparing a control plasmid with this 
same plasmid + LEC1, one-half the total embryos from each ear would be used 

25 for each treatment. In some cases the control treatment contained the Pat~GFP 
construct co-bombarded with GUS. Transformation frequency was determined by 
counting the numbers of embryos with large multicellular GFP-positive cells 
clusters using a GFP microscope, and representing these as a percentage of the 
original number of embryos bombarded. No distinction was made between 

30 embryos with single or multiple events. In all cases, the functional LEC1 

expression cassettes increased transformation frequencies over the control 

treatment (the LEC1 expression cassette also increased the incidence of multiple, 

i.e. 2-3, multicellular transgenic clones growing from the same immature embryo, 
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but as stated above we only scored these as a single event, and are providing a 
conservative representation of LECI's ability to improve transformation). For 
example, transformation frequencies in control treatments for three consecutive 
experiments were 5.1 , 7.4 and 0.8%. In balanced side-by-side comparisons for 
5 the same three experiments, transformation frequencies with the LEC1 

polynucleotide (ln2::LEC1::pinll) were 28.8, 25.7 and 12.4%, respectively. In 
addition to increasing the absolute number of transformants recovered from a 
given amount of target tissue, LEC1 transformants appeared earlier than the 
control transformants (suggesting that the LEC1 polynucleotide also stimulated 

10 growth rates). 

As a more stringent control, an expression vector was constructed in which 
a LEC1 gene, frame-shifted immediately after the start codon, was placed behind 
the In2 promoter. In this experiment using embryos from 3 separate ears, the 
transformation frequencies in the control (frame-shifted LEC1) treatments were 

15 2.7, 6.0 and 2.0%, while the transformation frequencies for the LEC1 treatments 
were 62.7, 26 and 42.7%. This demonstrated clearly that expression of the in- 
frame LEC1 polynucleotide was associated with dramatic increases in 
transformation efficiency. 

Increasing the promoter strength (driving LEC1 expression) increased 

20 transformation frequencies. For example, an experiment was performed to 

compare the In2, nos and UBI promoters. Based on our experience with these 
two promoters driving other genes, the In2 promoter (in the absence of an inducer 
other than auxin from the medium) would drive expression at very low levels. The 
nos promoter has been shown to drive moderately-iow levels of transgene 

25 expression (approximately 10- to 30-fold lower than the maize ubiquitin promoter, 
but still stronger than In2 under the culture conditions used in this experiment). 
Two control treatments were used in this experiment; the frame-shifted LEC1 
driven by the In2 promoter, or a maize ln2:HAP3 polynucleotide (a "non-LEC1 
type" representative of the transcriptional factor family to which LEC1 belongs). 

30 Both control treatments resulted in low transformation frequencies. After 3 weeks, 

the transformation frequency for the ln2:frame-shift-LEC1 (FS) treatment was 

4.8%, while for the ln2:HAP3 treatment it was 2%. The ln2:LEC1 , nos:LEC1 and 

UBI:LEC1 treatments resulted in 14%, 28% and 30% transformation frequencies, 
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respectively. Within these treatments there was also an increase in the overall 
frequency of large, rapidly growing calli. For the control treatments, the 
frequencies of large, vigorous GFP+ calli (relative to the starting number of 
embryos) was low (1 .6 and 0% for the frame-shift or ln2:HAP3, respectively). For 
5 the In2, nos and UBI treatments the overall frequencies of large, vigorous calli 
was 4, 13.3 and 20%, respectively. This is consistent with the interpretation that 
increased LEC1 expression resulted in more rapid in vitro growth of transgenic 
tissue. As is typical for transformation experiments scored in this fashion, 
between 3-6 weeks the number of recovered transformants continues to rise. 
10 After 5 weeks (post-bombardment), the frequency of healthy, growing 

transformants was 4.8 and 7.3% for the FS and HAP3 controls, while for the In2, 
nos and UBI-driven LEC1 treatments the frequencies were 22, 29.3 and 35.3%. 

C. Transformation frequency was improved by LEC1 introduced using 
15 agrobacterium. 

The Agrobacterium strains containing the superbinary plasmids described 
in Example 8A were used to transformed High type II embryos. Briefly, colonies 
containing the engineered Agrobacterium were grown to log phase in minimal A 
medium. Log phase cells were collected by centrifugation and resuspended in 

20 561 Q medium (N6 salts, Eriksson's vitamins, 1 .5mg/l 2,4-D , 68.5g/l sucrose, 36g/l 
glucose, plus 20mg/l acetosyringone). Immature embryos, 1.5-2mm in length, 
were excised and immersed in this solution at a concentration of 5 X 10 8 bacterial 
cells/ml. Embryos were vortexed in this medium and allowed to sit for 5 minutes. 
The embryos were then removed and placed on 562P medium (560P medium 

25 with 100mM acetosyringone and incubated at 20°C for 3 days. Embryos were 
moved again to 563N medium (an agar solidified medium similar to 560P with 
100mg/l carbenicillin, 0.5g/l MES and reduced 2,4-D) and cultured at 28°C for 3 
days. Embryos were then moved to 5630 medium (563N medium with 3mg/l 
bialaphos) and transferred thereafter every 14 days to fresh 5630 medium. 

30 Bialaphos resistant GFP+ colonies were counted using a GFP microscope 

and transformation frequencies were determined as described in example 8B. 

Similar to particle gun experiments, transformation frequencies were greatly 

increased in the LEC1 treatment. For example, transformation frequencies for the 
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control treatment across embryos taken from 7 separate ears were 7.1, 40.9, 
11.1, 7.4, 11.5, 12, 30.8, and 16.6%. The side-by-side comparison for the LEC 
treatment (in the same order of ears as above) showed that transformation 
frequencies were 13.5, 47, 55.8, 37.1, 40.6, 30, 57.1 and 40.8%. Averaged 
5 across all 7 ears, the average transformation frequency for the control was 16.6% 
while that of the LEC1 treatment was 40.8%. This represents a substantial 
increase for an already high baseline produced by Agrobacterium-mediated 
transformation. Comparing across ears, it was observed that the beneficial effects 
on transformation frequency were the greatest when the control frequencies were 
10 low. 

D. Transformants were recovered using LEC1 expression under reduced 
auxin levels or in the absence of auxins in the medium, and in the absence 
of herbicide or antibiotic selection. 

15 To determine if LEC1 could be used in a positive selection scheme, particle 

gun transformation experiments were initiated as described in Example 4 and 
transformants were selected on medium with normal auxin levels, or on medium 
with reduced or no auxin, or visually (using GFP) on medium without bialaphos. 
Transformation frequencies were based on the numbers of embryos with one or 

20 more multicellular GFP positive cell clusters. In the first experiment to test this 

concept, there were two treatment variables. The first was that immature embryos 
were bombarded with the control plasmid (UBI:PAT~GFP) or with 
UBI:PAT~GFP+ln2:LEC1 . The second variable was that the bombarded embryos 
were divided onto either normal bialaphos-containing selection medium (with 

25 normal auxin levels of 2 mg/l 2,4-D), or medium with no bialaphos and reduced 
2,4-D levels (0.5 mg/l). As expected from previous results, on bialaphos selection 
the LEC1 treatment resulted in a higher transformation frequency than the control 
(5.7 versus 2.5%). It was also anticipated that the low auxin medium (0.5 mg/I 
2,4-D) would result in reduced growth rates. Consistent with this, for the control 

30 plasmid treatment (UBI:PAT~GFP), recovery of GFP-expressing (fluorescent) 

colonies was reduced relative to highly-effective bialaphos-selection treatment, 

dropping down to 0.6%. In contrast, it appeared that LEC1 expression, through its 

stimulation of embryogenesis, may have compensated for the low auxin 
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environment, providing a growth advantage to the transgenic colonies, and 
maintaining the efficiency of transformant recovery at 4.0% (still in the same range 
as the LEC1/bialaphos-selected treatment). It's clear from this result that the 
inclusion of LEC1 improved colony growth on reduced auxin relative to the 
5 control. 

On medium completely devoid of auxin, colonies were only observed in the 
LEC1 treatment In this experiment, immature embryos were bombarded with 
either the control plasmid (UBI:PAT~GFP) or with UBI:PAT~GFP+ln2:LEC1, and 
then plated either onto 3.0 mg/l bialaphos, 2.0 mg/l 2,4-D medium or onto no- 

10 bialaphos, no 2,4-D medium (in this latter treatment, wild-type maize callus will not 
exhibit embryogenic growth). Again, as expected, the LEC1 polynucleotide 
increased transformation to 22.7% over the control plasmid value of 8% on normal 
auxin-containing, bialaphos selection medium. Also, as expected, no 
transformants were recovered with the control plasmid on medium devoid of 

15 exogenous auxin. Surprisingly, in the LEC1 treated embryos transformants were 
recovered at a 4% frequency (this was still higher than the control plasmid on 
bialaphos selection). 

Even on auxin-containing medium, the LEC1 polynucleotide in combination 
with GFP+ expression can be used to recover transformants without chemical 

20 selection. For example, under these conditions the recovery of transformants was 
relatively efficient (16% compared to 18% for bialaphos selection), but this 
required more diligence than the low- or no-auxin treatments above to separate 
the GFP-expressing colonies from the growing callus population. 

25 E. LEC1 improves the embryogenic phenotype and regeneration capacity of 
inbreds. 

Immature embryos from the inbred PHP38 were isolated, cultured and 

transformed as described in example 4 with the following changes. Embryos were 

initially cultured on 601 H medium (a MS based medium with 0.1mg/l zeatin, 2mg/l 

30 2,4-D, MS and SH vitamins, proline, silver nitrate, extra potassium nitrate, casein 

hydrolysate, gelrite, 10g/l glucose and 20g/l sucrose). Prior to bombardment 

embryos were moved to a high osmoticum medium (modified Duncan's with 2mg/l 

2,4-D and 12% sucrose). Post bombardment, embryos were moved to 601 H 
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medium with 3mg/l bialaphos for two weeks. Embryos were then moved to 601 H 
medium without proline and casein hydrolysate with 3mg/l bialaphos and 
transferred every two weeks. Transformation frequency was determined by 
counting the numbers of bialaphos resistant GFP-positive colonies. Colonies 
5 were also scored on whether they had an embryogenic (regenerable) or non- 
embryogenic phenotype. In PHP38, the LEC1 polynucleotide increased 
transformation frequency and improved the regenerative potential of the callus. 
For example, a balanced experiment (the embryos from each harvested ear were 
divided equally between treatments) was conducted in which PHP38 immature 

10 embryos were bombarded with the control plasmid (UBI::PAT~GFP::pinll) in one 
treatment, with the UBI::PAT~GFP::pinll plasmid + ln2::LEC1, or with the 
UBI::PAT~GFP::pinll plasmid + nuc1::LEC1 (a maize nucellus-specific promoter 
driving LEC1 expression). The frequency of GFP+ calli growing on bialaphos- 
containing media (relative to the starting number of embryos) was determined 6 

15 weeks after bombardment. For the control treatment, the transformation 

frequency was 1.2%, while for the ln2:LEC1 and nuc1::LEC1 treatments the 
transformation frequencies were 3.2 and 2.0% respectively. In addition, the 
presence of the LEC1 polynucleotide appeared to greatly improve the 
regeneration capacity of the recovered transformants. None of the control 

20 transformants (UBI::PAT~GFP::pinll alone) had an embryogenic, regenerable 

phenotype, while the transformants recovered from the ln2:LEC1 and nuc1::LEC1 
treatments all exhibited a more vigorous, embryogenic growth pattern. This has 
been born out in the ability to recover plants. Callus from the ln2:LEC1 and 
nuc1::LEC1 treatments has produced many healthy plants. 

25 

Example 9 

Transient Expression of the LEC1 polynucleotide Product 
to Induce Somatic Embryogenesis 

30 It may be desirable to "kick start" somatic embryogenesis by transiently 

expressing the LEC-1 polynucleotide product. This can be done by delivering 
LEC1 5'capped polyadenylated RNA, expression cassettes containing LEC-1 
DNA, or LEC-1 protein. All of these molecules can be delivered using a biolistics 
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particle gun. For example 5'capped polyadenylated LEC1 RNA can easily be 
made in vitro using Ambion's mMessage mMachine kit. Following the procedure 
outline above RNA is co-delivered along with DNA containing an agronomically 
useful expression cassette. The cells receiving the RNA will immediately form 
5 somatic embryos and a large portion of these will have integrated the agronomic 
gene. Plants regenerated from these embryos can then be screened for the 
presence of the agronomic gene. 

Example 10 

10 Use of the maize LEC1 to induce apomixis 

Maize expression cassettes directing LEC1 expression to the inner 
integument or nucellus can easily be constructed. An expression cassette 
directing expression of the LEC1 polynucleotide to the nucellus was made using 

15 the barley Nuc1 promoter. Embryos were co-bombarded with the selectable 
marker PAT fused to the GFP gene along with the nucellus specific LEC1 
expression cassette described above. Both inbred (PHP38) and GS3 
transformants were obtained and regenerated as described in examples 4 and 5. 
Transformation frequencies were also increased over the control using the 

20 nuc1:LEC1 polynucleotide (see Example 8 above). 

It is anticipated that the regenerated plants will then be capable of 
producing de novo embryos from LEC1 expressing nucellar cells. This is 
complemented by pollinating the ears to promote normal central cell fertilization 
and endosperm development. In another variation of this scheme, nuc1:LEC1 

25 transformations could be done using a FIE-null genetic background which would 
promote both de novo embryo development and endosperm development without 
fertilization (see Ohad etai 1999 The Plant Cell 11:407-415; also pending US 
application Serial No. 60/151575 filed August 31, 1999). Upon microscopic 
examination of the developing embryos it will be apparent that apomixis has 

30 occurred by the presence of embryos budding off the nucellus. In yet another 

variation of this scheme the LEC1 polynucleotide could be delivered as described 
above into a homozygous zygotic-embryo-lethal genotype. Only the adventive 
embryos produced from somatic nucellus tissue would develop in the seed. 
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EXAMPLE 11 
Expression of Chimeric Genes in Microbial Cells 



The cDNAs encoding the instant transcription factors can be inserted into the 
5 T7 E. coli expression vector pBT430. This vector is a derivative of pET-3a 

(Rosenberg etal. (1987) Gene 56:125-135) which employs the bacteriophage T7 
RNA po!ymerase/T7 promoter system. Plasmid pBT430 was constructed by first 
destroying the EcoR I and Hind III sites in pET-3a at their original positions. An 
oligonucleotide adaptor containing EcoR I and Hind III sites was inserted at the 

10 BamH I site of pET-3a. This created pET-3aM with additional unique cloning sites 
for insertion of genes into the expression vector. Then, the Nde I site at the 
position of translation initiation was converted to an Nco I site using 
oligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this 
region, 5-CATATGG, was converted to S'-CCCATGG in pBT430. 

15 Plasmid DNA containing a cDNA may be appropriately digested to release a 

nucleic acid fragment encoding the protein. This fragment may then be purified 
on a 1% NuSieve GTG™ low melting agarose gel (FMC). Buffer and agarose 
contain 10 jag/ml ethidium bromide for visualization of the DNA fragment. The 
fragment can then be purified from the agarose gel by digestion with GELase™ 

20 (Epicentre Technologies) according to the manufacturer's instructions, ethanol 
precipitated, dried and resuspended in 20 jaL of water. Appropriate 
oligonucleotide adapters may be ligated to the fragment using T4 DNA ligase 
(New England Biolabs, Beverly, MA). The fragment containing the ligated 
adapters can be purified from the excess adapters using low melting agarose as 

25 described above. The vector pBT430 is digested, dephosphorylated with alkaline 
phosphatase (NEB) and deproteinized with phenol/chloroform as described 
above. The prepared vector pBT430 and fragment can then be ligated at 16°C 
for 15 hours followed by transformation into DH5 electrocompetent cells (GIBCO 
BRL). Transformants can be selected on agar plates containing LB media and 

30 100 i^g/mL ampicillin. Transformants containing the polynucleotide encoding the 
transcription factor are then screened for the correct orientation with respect to 
the T7 promoter by restriction enzyme analysis. 
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For high level expression, a plasmid clone with the cDNA insert in the correct 
orientation relative to the T7 promoter can be transformed into E. coli strain 
BL21(DE3) (Studier et al. (1986) J. MoL Biol. 189'A 13-130). Cultures are grown in 
LB medium containing ampicillin (100 mg/L) at 25°C. At an optical density at 600 
5 nm of approximately 1, IPTG (isopropylthio-p-galactoside, the inducer) can be 
added to a final concentration of 0.4 mM and incubation can be continued for 3 
hours at 25°C. Cells are then harvested by centrifugation and re-suspended in 50 
IliL of 50 mM Tris-HCI at pH 8.0 containing 0.1 mM DTT and 0.2 mM phenyl 
methylsulfonyi fluoride. A small amount of 1 mm glass beads can be added and 
10 the mixture sonicated 3 times for about 5 seconds each time with a microprobe 
sonicator. The mixture is centrifuged and the protein concentration of the 
supernatant determined. One pg of protein from the soluble fraction of the culture 
can be separated by SDS-polyacrylamide gel electrophoresis. Gels can be 
observed for protein bands migrating at the expected molecular weight. 

15 

EXAMPLE 12 

Evaluating Compounds for Their Ability to Inhibit the Activity 
of Plant Transcription Factors 

20 The transcription factors described herein may be produced using any 

number of methods known to those skilled in the art. Such methods include, but 
are not limited to, expression in bacteria as described in Example 6, or expression 
in eukaryotic cell culture, in plants, and using viral expression systems in suitably 
infected organisms or cell lines. The instant transcription factors may be 

25 expressed either as mature forms of the proteins as observed in vivo or as fusion 
proteins by covalent attachment to a variety of enzymes, proteins or affinity tags. 
Common fusion protein partners include glutathione S-transferase ("GST"), 
thioredoxin ("Trx"), maltose binding protein, and C- and/or N-terminal 
hexahistidine polypeptide ("(His) 6 "). The fusion proteins may be engineered with 

30 a protease recognition site at the fusion point so that fusion partners can be 

separated by protease digestion to yield intact mature enzyme. Examples of such 
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proteases include thrombin, enterokinase and factor Xa. However, any protease 
can be used which specifically cleaves the peptide connecting the fusion protein 
and the enzyme. 

Purification of the instant transcription factors, if desired, may utilize any 
5 number of separation technologies familiar to those skilled in the art of protein 
purification. Examples of such methods include, but are not limited to, 
homogenization, filtration, centrifugation, heat denaturation, ammonium sulfate 
precipitation, desalting, pH precipitation, ion exchange chromatography, 
hydrophobic interaction chromatography and affinity chromatography, wherein the 

10 affinity ligand represents a substrate, substrate analog or inhibitor. When the 
transcription factors are expressed as fusion proteins, the purification protocol 
may include the use of an affinity resin which is specific for the fusion protein tag 
attached to the expressed enzyme or an affinity resin containing ligands which are 
specific for the enzyme. For example, a transcription factor may be expressed as 

15 a fusion protein coupled to the C-terminus of thioredoxin. In addition, a (His) 6 

peptide may be engineered into the N-terminus of the fused thioredoxin moiety to 
afford additional opportunities for affinity purification. Other suitable affinity resins 
could be synthesized by linking the appropriate ligands to any suitable resin such 
as Sepharose-4B. In an alternate embodiment, a thioredoxin fusion protein may 

20 be eluted using dithiothreitol; however, elution may be accomplished using other 
reagents which interact to displace the thioredoxin from the resin. These reagents 
include p-mercaptoethanol or other reduced thiol. The eluted fusion protein may 
be subjected to further purification by traditional means as stated above, if 
desired. Proteolytic cleavage of the thioredoxin fusion protein and the enzyme 

25 may be accomplished after the fusion protein is purified or while the protein is still 
bound to the ThioBond™ affinity resin or other resin. 

Crude, partially purified or purified enzyme, either alone or as a fusion 
protein, may be utilized in assays for the evaluation of compounds for their ability 
to inhibit enzymatic activition of the transcription factors disclosed herein. Assays 

30 may be conducted under well-known experimental conditions that permit optimal 
enzymatic activity. 
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Example 13 

LEC1 expression resulted in increased growth rates, which could be used 
as a screening criterion for positive selection of transformants. 



5 Using two promoters of increasing strength to drive LEC1 expression in 

maize, it appeared that LEC1 stimulated callus growth over control treatments and 
the stronger promoter driving LEC1 resulted in faster growth than with the low- 
level promoter. For example, an experiment was performed to compare the In2 
and nos promoters. As noted above, based on our experience with these two 

10 promoters driving other genes, the In2 promoter (in the absence of an inducer 

other than auxin from the medium) would drive expression at very low levels. The 
nos promoter has been shown to drive moderately-low levels of transgene 
expression (approximately 10- to 30-fold lower than the maize ubiquitin promoter, 
but still stronger than In2 under the culture conditions used in this experiment). 

15 One control treatment was used in this experiment, the UBI:PAT~GFPmo:pinll 
construct by itself (with no LEC1). Hi-ll immature embryos were bombarded as 
previously described, and transgenic, growing events were scored at 3 and 6 
weeks. The control treatment resulted in a transformation frequency of 0.8%. 
The ln2:LEC1 and nos:LEC1 treatments resulted in transformation frequencies of 

20 26.5 and 40.7%, respectively. 

Within these treatments there was also an increase in the overall frequency of 
large, rapidly growing calli, relative to the control treatment. For this data, the 
fresh weight of transformed calli were recorded 2 months after bombardment. 
Assuming that all the transgenic events started as single transformed cells within 

25 a few days after bombardment, these weights represent the relative growth rate of 
these transformants during this period (all tissue was sub-cultured and weighed 
for each transformant; mean weights and standard deviations were calculated for 
each treatment). For the control treatment, the mean transformant weight after 
two months was 37 +/- 15 mg (n=6). For the ln2:LEC1 and nos:LEC1 treatments, 

30 the mean transformant weights were 126 +/- 106 and 441 +/- 430 mg, 

respectively. If the control treatment was set at a relative growth value of 1 .0, this 
means that transformants in the ln2:LEC1 and nos:LEC1 treatments grew 3.4 and 
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12-fold faster than the control. From this data, it appears that increasing LEC1 
expression resulted in a concomitant increase in callus growth rate. 

Example 14 

5 The use of LEC1 polynucleotide as a positive selection system for wheat 
transformation and for improving the regeneration capacity of wheat tissues 

Method 
Plant Material 

10 Seeds of wheat Hybrinova lines NH535 and BO 014 were sown into soil in 

plug trays for vernalisation at 6°C for eight weeks. Vernalized seedlings were 
transferred in 8" pots and grown in a controlled environment room. The growth 
conditions used were; 1) soil composition: 75% L&P fine-grade peat, 12% 
screened sterilized loam, 10% 6 mm screened, lime-free grit, 3% medium grade 

15 vermiculite, 3.5 kg Osmocote per m 3 soil (slow-release fertiliser, 15-11-13 NPK 
plus micronutrients), 0.5 kg PG mix per m 3 (14-1 6-1 8 NPK granular fertiliser plus 
micronutrients, 2) 16 h photoperiod (400W sodium lamps providing irradiance of 
ca. 750 jaE s 1 nr 2 ), 18 to 20°C day and 14 to 16°C night temperature, 50 to 70% 
relative air humidity and 3) pest control: sulphur spray every 4 to 6 weeks and 

20 biological control of thrips using Amblyseius caliginosus (Novartis BCM Ltd, UK). 

Isolation of Explants and Culture Initiation 

Two sources of primary explants were used; scutellar and inflorescence 
tissues. For scutella, early-medium milk stage grains containing immature 

25 translucent embryos were harvested and surface-sterilized in 70% ethanol for 5 
min and 0.5% hypochlorite solution for 15-30 min. For inflorescences, tillers 
containing 0.5-1.0 cm inflorescences were harvested by cutting below the 
inflorescence-bearing node (the second node of a tiller). The tillers were trimmed 
to approximately 8-10 cm length and surface-sterilized as above with the upper 

30 end sealed with Nescofilm (Bando Chemical Ind. Ltd, Japan). 

Under aseptic conditions, embryos of approximately 0.5-1 .0 mm length 

were isolated and the embryo axis removed. Inflorescences were dissected from 

the tillers and cut into approximately 1 mm pieces. Thirty scutella or 1 mm 
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inflorescence explants were placed in the center (18 mm target circle) of a 90 mm 
Petri dish containing MD0.5 or L7D2 culture medium. Embryos were placed with 
the embryo-axis side in contact with the medium exposing the scutellum to 
bombardment whereas inflorescence pieces were placed randomly. Cultures 
5 were incubated at 25±°C in darkness for approximately 24 h before bombardment. 
After bombardment, explants from each bombarded plate were spread across 
three plates for callus induction. 

Culture Media 

1 0 The standard callus induction medium for scutellar tissues (MD0.5) 

consisted of solidified (0.5% Agargel, Sigma A3301) modified MS medium 
supplemented with 9% sucrose, 10 mg I^AgNOs and 0.5 mg I" 1 2,4-D (Rasco- 
Gaunt era/., 1999). Inflorescence tissues were cultured on L7D2 which consisted 
of solidified (0.5% Agargel) L3 medium supplemented with 9% maltose and 2 mg 

15 T 1 2,4-D (Rasco-Gaunt and Barcelo, 1999). The basal shoot induction medium, 
RZ contained L salts, vitamins and inositol, 3% w/v maltose, 0.1 mg V 2,4-D and 5 
mg I -1 zeatin (Rasco-Gaunt and Barcelo, 1999). Regenerated plantlets were 
maintained in RO medium with the same composition as RZ, but without 2,4-D 
and zeatin. 

20 

DNA Precipitation Procedure and Particle Bombardment 

Submicron gold particles (0.6 urn Micron Gold, Bio-Rad) were coated with a 
plasmid containing the maize ln-2:LEC1 construct following the protocol modified 
from the original Bio-Rad procedure (Barcelo and Lazzeri, 1995). The standard 

25 precipitation mixture consisted of 1 mg of gold particles in 50 pi SDW, 50 pi of 2.5 
M calcium chloride, 20 pi of 100 mM spermidine free base and 5 pi DNA 
(concentration 1 ug pi" 1 ). After combining the components, the mixture was 
vortexed and the supernatant discarded. The particles were then washed with 
150 pi absolute ethanol and finally resuspended in 85 pi absolute ethanol. The 

30 DNA/gold ethanol solution was kept on ice to minimise ethanol evaporation. For 
each bombardment, 5 pi of DNA/gold ethanol solution (ca. 60 jag gold) was loaded 
onto the macrocarrier. 
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Particle bombardments were carried out using DuPont PDS 1000/He gun 
with a target distance of 5.5 cm from the stopping plate at 650 psi acceleration 
pressure and 28 in. Hg chamber vacuum pressure. 



5 Regeneration of Transformants 

For callus induction, bombarded explants were distributed over the surface 
of the medium in the original dish and two other dishes and cultured at 25+1 °C in 
darkness for three weeks. Development of somatic embryos from each callus 
were periodically recorded. For shoot induction, calluses were transferred to RZ 

10 medium and cultured under 12 h light (250 s~ 1 rrf 2 , from cool white fluorescent 
tubes) at 25+1 °C for three weeks for two rounds. All plants regenerating from the 
same callus were noted. Plants growing more vigorously than the control cultures 
were potted in soil after 6-9 weeks in R0 medium. The plantlets were acclimatized 
in a propagator for 1-2 weeks. Thereafter, the plants were grown to maturity 

15 under growth conditions described above. 

DNA Isolation from Callus and Leaf Tissues 

Genomic DNA was extracted from calluses or leaves using a modification 
of the CTAB (cetyltriethylammonium bromide, Sigma H5882) method described by 

20 Stacey and Isaac (1994). Approximately 100-200 mg of frozen tissues was 

ground into powder in liquid nitrogen and homogenised in 1 ml of CTAB extraction 
buffer (2% CTAB, 0.02 M EDTA, 0.1 M Tris-CI pH 8, 1.4 M NaCI, 25 mM DTT) for 
30 min at 65°C. Homogenised samples were allowed to cool at room temperature 
for 15 min before a single protein extraction with approximately 1 ml 24:1 v/v 

25 chloroform:octanol was done. Samples were centrifuged for 7 min at 13,000 rpm 
and the upper layer of supernatant collected using wide-mouthed pipette tips. 
DNA was precipitated from the supernatant by incubation in 95% ethanoi on ice 
for 1 h. DNA threads were spooled onto a glass hook, washed in 75% ethanoi 
containing 0.2 M sodium acetate for 10 min, air-dried for 5 min and resuspended 

30 in TE buffer. Five pi RNAse A was added to the samples and incubated at 37°C 
for 1 h. 
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For quantification of genomic DNA, gel electrophoresis was performed 
using an 0.8% agarose gel in 1x TBE buffer. One microlitre of the samples were 
fractionated alongside 200, 400, 600 and 800 ng pi 1 X uncut DNA markers. 



5 Polymerase chain reaction (PCR) analysis 

The presence of the maize LEC1 polynucleotide was analyzed by PCR 
using 100-200 ng template DNA in a 30 ml PCR reaction mixture containing 1X 
concentration enzyme buffer (10 mM Tris-HCI pH 8.8, 1.5 mM magnesium 
chloride, 50 mM potassium chloride, 0.1% Triton X-100), 200 uM dNTPs, 0.3 uM 

1 0 primers and 0.022 U TaqDNA polymerase (Boehringer Mannheim). 

Thermocycling conditions were as follows (30 cycles): denaturation at 95°C for 30 
s, annealing at 55°C for 1 min and extension at 72°C for 1 min. Primer sequences 
(F=forward; R=reverse) used were: (F) 5'-CGC TCT GTC ACC TGT TGT ACT C-3' 
(R) 5-CGT GAT GAA GCT GAT GTA CTC C-3'. Approximate PCR product length 

15 was 620 bp. 



Results 

Following experiments to show increased regeneration capacity and 
improvement of maize transformation frequencies by expression of maize LEC1 , the 

20 polynucleotide was then introduced into wheat scutellar and inflorescence explants, 
driven by the maize In2 promoter. Both tissues are used for wheat transformation. 

Subsequentto the induction of somatic embryos from both tissues after three 
weeks on a 2,4-D-containing induction medium, calluses were assessed prior to 
transfer onto shoot regeneration medium. Callus assessment involved: a) scoring 

25 calluses as 0 = non-embryogeniccallus, 1 = 25%, 2 = 25-50%, 3 = 50-75%, 4 = 75- 
1 00% of callus surface embryogenic, and b) determining embryogenic capacity 
expressed in percentage as the number of embryogenic calluses /total number of 
calluses (scutella or inflorescence) assessed. 



30 Scutellum calluses 

Mean callus scores of control (1 .4 ± 0.3) and LEC-bombarded (1 .4 ± 0.3) 
scutellar tissues of wheat line NH535 were not significantlydifferent. However, 
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callus score of LEC-bombarded scutella (1 .5 ± 0.5) of wheat line BO 014 was 
significantly improved in comparison with the control (0.5 ± 0.2). Similarly, 
embryogenic capacity of line NH535 did not seem to be affected by LEC 
treatment (LEC calluses = 84.3 ± 9.3%, control calluses = 90%). However, LEC- 
5 bombarded line BO 014 had clear increases in the embryogenic callus frequency 
(LEC calluses = 75.4 ± 16.8%, control calluses = 36.7 ± 4.7%). Examining the 
quality of embryogenic calluses formed, both lines showed significant increases in 
the number of 'good' calluses produced i.e. calluses with scores of 3 or 4. 'Good' 
quality callus of line NH535 increased from 5 to 22.3 % whilst line BO 014 
10 increased from 0 to 23.6%. These calluses were generally large, rapidly growing 
and vigorous. 

After callus induction and assessment, calluses were transferred onto shoot 
induction media for a total of six weeks. Shoot regeneration of calluses was 
determined, as the number of shoot regenerating calluses / total number of calluses 

1 5 assessed (expressed as percentages). Shoot regeneration of cultures 

corresponded with the quality and quantity of somatic embryos produced in each 
callus. Hence, regeneration of LEC-bombarded (71 .9 ±12.1) and control (70 ± 14) 
callus tissues of line NH535 were not significantly different. However, regeneration 
of LEC-bombarded calluses (52.3 ± 26.9) of wheat line BO 014 was significantly 

20 improved in comparison with the control (1 5.6 ±6.3). 

To test the suitability of LEC as a positive selection system for wheat, 
sample tissues from vigorous calluses were analyzed for the presence of LEC 
sequences. Forty-one BO 014 and 13 NH535 calluses were selected. The 
results were that 10/41 BO 014 calluses and 8/13 NH535 were PCR positive. 

25 Thus, transformed lines were identified without selection at frequencies of 24.4% 
and 61 .5%. These frequencies are comparable with conventional selection 
systems such as herbicide- and antibiotic-resistance systems (e.g. bar, nptH) 
applied in wheat transformation where selection 'escape' frequencies are 
commonly high and variable. Furthermore, we know of no other report of wheat 

30 transformation by morphological selection in the absence of a selection agent. 
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Callus transformation frequencies were 5.6% and 4.4% in NH535 and BO 
014 lines, respectively. Transgenic plants were also recovered from LEC-positive 
callus lines. Seven non-clonal plants were recovered from NH535 and six non- 
clonal plants were recovered from BO 014 to give plant transformation 
5 frequencies of 3.9 and 3.3%, respectively, based on the number of explants 
bombarded. 

Inflorescence calluses 

The use of inflorescence tissues as explants for the tissue culture and 
1 0 transformation of wheat offer several advantages over seed explants such as 

scutella (Rasco-Gaunt and Barcelo, 1 999). However, responses of these tissues to 
culture are highly genotype-dependentand calluses are often non-regenerative 
despite having a 'highly-embryogenic'appearance. Hence, LEC was introduced 
into inflorescence tissues to see whether regeneration could be enhanced on a 
1 5 poorly regenerating line such BO 014. 

Using line BO 014, shoot regeneration was significantly improved in LEC- 
bombarded tissues, although callus quality appeared similar in bombarded and 
control tissues. Whereas no shoot was regenerated from control cultures, eight 
plants were regenerated from LEC-bombarded calluses to give a shoot 
20 regeneration frequency of 1 0.7%. 

Tables 



Summary of tissue culture data 



Wheat 
line 


Treatment 


Mean 
callus 
score 
(0-4) 


Embryo. 
Capacity 

(%) 


% Good 
calluses 
(score 
3&4) 


Regeneration 
(%) 


NH 535 


Control 


1.4 ±0.3 


90.0 


5 


70 ± 14 




LEC1 


1.4 ±0.3 


84.3 ± 9.3 


22.3 


71.9 ± 12.1 


BO 014 


Control 


0.5 ±0.2 


36.7 ± 4.7 


0 


15.6 ±6.3 




LEC1 


1.5 + 0.5 


75.4 ± 
16.8 


23.6 


52.3 ±26.9 
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Type of callus produced per treatment per line 



5 



Callus Score 


NH535 


BO0 14 




Control 


LEC1 


Control 


LEC1 


0 


10 


15.7 


63.2 


25 


1 


45 


43.3 


26.3 


30 


2 


40 


28.7 


10.5 


21.4 


3 


5 


11.2 


0 


17.9 


4 


0 


1.1 


0 


5.7 


Transformation Frequency 


Wheat line 


Callus line 


Plant line (non-clonal) 


NH 535 


10/180 (5.6%) 


7/180 (3.9%) 


BO 014 


8/180 (4.4%) 


6/180 (3.3%) 



10 

EXAMPLE 15 
Expression of Chimeric Genes in Dicot Cells 

The LEC1 polynucleotide can also be used to improve the transformation of 
15 soybean. To demonstrate this the construct consisting of the In2 promoter and 
LEC1 coding region were introduced into embryogenic suspension cultures of 
soybean by particle bombardment using essentially the methods described in 
Parrott, W.A., L.M. Hoffman, D.F. Hildebrand, E.G. Williams, and G.B. Collins, 
(1989) Recovery of primary transformants of soybean, Plant Cell Rep. 7:615-617. 
20 This method with modifications is described below. 

Seed was removed from pods when the cotyledons were between 3 and 5 
mm in length. The seeds were sterilized in a Chlorox solution (0.5%) for 15 
minutes after which time the seeds were rinsed with sterile distilled water. The 
immature cotyledons were excised by first cutting away the portion of the seed 
25 that contains the embryo axis. The cotyledons were then removed from the seed 
coat by gently pushing the distal end of the seed with the blunt end of the scalpel 
blade. The cotyledons were then placed (flat side up) SB1 initiation medium (MS 
salts, B5 vitamins, 20 mg/L 2,4-D, 31.5 g/l sucrose, 8 g/L TC Agar, pH 5.8). The 
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Petri plates were incubated in the light (16 hr day; 75-80 ^E) at 26° C. After 4 
weeks of incubation the cotyledons were transferred to fresh SB1 medium. After 
an additional two weeks, globular stage somatic embryos that exhibit proliferative 
areas were excised and transferred to FN Lite liquid medium (Samoylov, V.M., 
D M. Tucker, and W.A. Parrott (1998) Soybean [Glycine max (L.) Merrill] 
embryogenic cultures: the role of sucrose and total nitrogen content on 
proliferation. In Vitro Cell Dev. Biol.- Plant 34:8-13). About 10 to 12 small clusters 
of somatic embryos were placed in 250 ml flasks containing 35 ml of SB172 
medium. The soybean embryogenic suspension cultures were maintained in 35 
ml_ liquid media on a rotary shaker, 150 rpm, at 26°C with florescent lights (20 j^E) 
on a 16:8 hour day/night schedule. Cultures were sub-cultured every two weeks 
by inoculating approximately 35 mg of tissue into 35 ml_ of liquid medium. 

Soybean embryogenic suspension cultures were then be transformed using 
particle gun bombardment (Klein etal. (1987) Nature (London) 327:70, U.S. 
Patent No. 4,945,050). A BioRad Biolistic™ PDS1000/HE instrument was used 
for these transformations. A selectable marker gene which was used to facilitate 
soybean transformation is a chimeric gene composed of the 35S promoter from 
Cauliflower Mosaic Virus (Odell etal. (1985) Nature 373:810-812), the hygromycin 
phosphotransferase gene from plasmid pJR225 (from E. coli; Gritz etal. (1983) 
Gene 25:179-188) and the 3' region of the nopaline synthase gene from the 
T-DNA of the Ti plasmid of Agrobacterium tumefaciens. 

To 50 nL of a 60 mg/mL 1 ^m gold particle suspension was added (in order): 
5 nL DNA (1 i^g/jaL), 20 jlx! spermidine (0.1 M), and 50 jaL CaCI 2 (2.5 M). The 
particle preparation was agitated for three minutes, spun in a microfuge for 10 
seconds and the supernatant removed. The DNA-coated particles were washed 
once in 400 j^L 70% ethanol and resuspended in 40 ]iL of anhydrous ethanol. 
The DNA/particle suspension was sonicated three times for one second each. 
Five jj-L of the DNA-coated gold particles were then loaded on each macro carrier 
disk. 

Approximately 300-400 mg of a two-week-old suspension culture was placed 
in an empty 60x15 mm petri dish and the residual liquid removed from the tissue 
with a pipette. Membrane rupture pressure was set at 1 100 psi and the chamber 
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was evacuated to a vacuum of 28 inches mercury. The tissue was placed 
approximately 8 cm away from the retaining screen, and was bombarded three 
times. Following bombardment, the tissue was divided in half and placed back 
into 35 ml of FN Lite medium. 

Five to seven days after bombardment, the liquid medium was exchanged 
with fresh medium. Eleven days post bombardment the medium was exchanged 
with fresh medium containing 50 mg/mL hygromycin. This selective medium was 
refreshed weekly. Seven to eight weeks post bombardment, green, transformed 
tissue was observed growing from untransformed, necrotic embryogenic clusters. 
Isolated green tissue was removed and inoculated into individual flasks to 
generate new, clonally propagated, transformed embryogenic suspension 
cultures. Each new line was treated as an independent transformation event. 
These suspensions were then subcultured and maintained as clusters of 
immature embryos, or tissue was regenerated into whole plants by maturation and 
germination of individual embryos. 

Two different genotypes were used in these experiments: 92B91 and 
93B82. Samples of tissue were either bombarded with the hygromycin resistance 
gene alone or with a 1:1 mixture of the hygromycin resistance gene and the LEC1 
construct. Embryogenic cultures generated from 92B91 generally produce 
transformation events while cultures from 93B82 are much more difficult to 
transform. For transformation experiments with 92B91 , approximately equal 
numbers of transformants were recovered from bombardments conducted with the 
LEC1 polynucleotide as without it. Twenty-nine transformants were recovered 
from the LEC1 -treated 92B91 tissue while 27 transformants were recovered from 
tissue receiving only the hygromycin resistance gene. In contrast, transformants 
were only recovered from 93B82 tissue receiving the LEC1 polynucleotide (none 
were recovered from the treatment using only the hygromycin resistance gene). 
Five transformants were recovered from 93B82 tissue bombarded with the LEC1 
polynucleotide while no transformants were recovered from tissue treated with 
only the hygromycin resistance gene. These results show that the LEC1 
polynucleotide will be very valuable for gene transfer to recalcitrant genotypes of 
soybean, 
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WHAT IS CLAIMED IS 



isolated nucleic acid comprising a member selected from the group 
consisting of: 

(a) a polynucleotide which encodes a polypeptide of SEQ ID NO: 
2, 8, 10, 12, 14, 16, 18, 20, or 22; 

(b) a polynucleotide amplified from a plant nucleic acid library 
using the primers of SEQ ID NOS: 3 and 4, 5 and 6, 9 and 1 0, 
or 1 1 and 12 or primers determined by using Vector nti Suite, 

I nforMax Version 5. 

(c) a polynucleotide comprising at least 20 contiguous bases of 
SEQ ID NO: 1, 7, 9, 11, 13, 15, 17, 19, or21; 

(d) a polynucleotide having a plant HAP3-type ccaat-box 
transcriptional activator, wherein the polynucleotide is from a plant 
other than Arabidopsis] 

(e) a polynucleotide having at least 60% sequence identity to SEQ 
ID NO: 1, 9, 11, 13, 17, or 21 or 65% sequence identity to SEQ 
ID NO: 15 or 19 or 70% sequence identity to SEQ ID NO: 7, 
wherein the % sequence identity is based on the entire 
sequence and is determined by GAP analysis using Gap 
Weight of 50 and Length Weight of 3; 

(f) a polynucleotide comprising at least 25 nucleotides in length 
which hybridizes under high stringency conditions to a 
polynucleotide having the sequence set forth in SEQ ID NO: 
1,7, 9, 11, 13, 15, 17, 19, or21; 

(g) a polynucleotide containing a conserved region coding the 
polypeptide of SEQ ID NO: 23, wherein the polynucleotide is 
from a plant other than Arabidopsis] 

(h) a polynucleotide having the sequence set forth in SEQ ID NO: 
1,7, 9,11,13,15,17,19, or21;and 

(i) a polynucleotide complementary to a polynucleotide of (a) 
through (h). 
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2. The isolated nucleic acid of claim 1 , wherein the transcriptional activator in 
(d) is from corn, soybean, wheat, rice, Veronia, orArgemone. 

3. The isolated nucleic acid of claim 1 wherein the polynucleotide in (f) is a 
5 functional equivalent. 

4. A vector comprising at least one nucleic acid of claim 1 . 

5. An expression cassette comprising at least one nucleic acid of claim 1 
10 operably linked to a promoter, wherein the nucleic acid is in sense or 

antisense orientation. 

6. The expression cassette of claim 5, wherein the nucleic acid is operably 
linked in antisense orientation to the promoter. 

15 

7. A host cell containing at least one expression cassette of claim 5. 

8. The host cell of claim 7 that is a plant cell. 

20 9. A transgenic plant comprising at least one expression cassette of claim 5. 

10. The transgenic plant of claim 9, wherein the plant is corn, soybean, 
sorghum, wheat, rice, alfalfa, sunflower, canola or cotton. 

25 11. A seed from the transgenic plant of claim 9. 

12. The seed from the transgenic plant of claim 1 0. 

13. An isolated protein comprising a member selected from the group 
30 consisting of: 

(a) a polypeptide comprising at least 25 contiguous amino acids of SEQ 
ID NO: 2, 8, 10, 12, 14, 16, 18, 20, or 22; 
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(b) 



a polypeptide which is a LEC1 transcriptional activator containing 
the conserved motifs of SEQ ID NO: 23, wherein the polypeptide is 
from a plant other than Arabidopsis] 

a polypeptide comprising at least 60% sequence identity to SEQ ID 
NO: 2, 12, 14, 16, 20, or 22 or 70 % sequence identity to SEQ ID NO: 
8, 10, or 18, wherein the % sequence identity is based on the entire 
sequence and is determined by GAP analysis using Gap Weight of 12 
and Length Weight of 4; 

a polypeptide encoded by a nucleic acid of claim 1; 

a polypeptide encoded by a nucleic acid of SEQ ID NO: 1, 7, 9, 11, 

13, 15, 17, 19, or 21; and 

a polypeptide having the sequence set forth in SEQ ID NO: 2, 8, 10, 
12, 14, 16, 18, 20, or 22. 



(c) 



5 



10 



(d) 
(e) 



(f) 



15 14. The protein of claim 13, wherein the transcriptional activator of (b) is a 
LEC1 protein from corn, soybeans, wheat, rice, Veronia, or Argemone. 

15. A ribonucleic acid sequence encoding a protein of claim 13. 

20 16. A method for modulating LEC1 activity in a plant, comprising: 

(a) transforming a plant cell with at least one expression cassette of 
claim 5 and 

(b) regenerating a transformed plant that expresses the at least one 
nucleic acid in an amount sufficient to modulate LEC1 activity in the 

25 plant. 

17. The method of claim 16, wherein the plant is corn, soybean, sorghum, 



wheat, rice, alfalfa, sunflower, canola or cotton. 



30 18. The method of claim 16, wherein LEC1 activity is increased. 



19. The method of claim 16, wherein LEC1 activity is decreased. 
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20. A method for transiently modulating the level of LEC1 activity in plant cells 
comprising introducing at least one nucleic acid of claim 1. 

21 . The method of claim 20 wherein the at least one nucleic acid is a 
5 ribonucleic acid. 

22. A method for transiently modulating the level of LEC1 activity in plant cells 
comprising introducing at least one polypeptide of claim 13. 

10 23. A method for enhancing tissue culture response in a plant cell comprising 
introducing into the plant cell at least one LEC1 polypeptide or at least one 
LEC1 polynucleotide under conditions sufficient to enhance tissue culture 
response. 

15 24. The method of claim 23 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 

25. The method of claim 23 wherein at least one polynucleotide is operably 
linked to a promoter driving expression in the plant cell and growing the 

20 plant cell. 

26. The method of claim 23 wherein the plant cell is a recalcitrant cell. 

28. The method of claim 26 wherein the plant cell is an inbred plant cell. 

25 

29. A method for inducing somatic embryogenesis in a plant cell comprising 
introducing into a responsive plant cell at least one LEC1 polypeptide or at 
least one LEC1 polynucleotide, wherein the responsive plant cell is 
transformed and grown under conditions sufficient to stimulate the 

30 production of a transformed embryo and wherein the plant cell is other than 

an Arabidopsis cell. 
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30. The method of claim 29 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 

31 . The method of claim 30 wherein at least one polynucleotide is operably 
5 linked to a promoter driving expression in the plant cell. 

32. The method of claim 29 further comprising growing the transformed embryo 
under plant growing conditions to produce a regenerated plant. 

10 33. The method of claim 29 wherein the plant cell is from corn, soybean, 
sorghum, wheat, rice, alfalfa, sunflower, canola or cotton. 

34. A plant produced by the method of claim 29. 

15 35. A method for positive selection of a transformed cell comprising introducing 
into a responsive plant cell at least one LEC1 polynucleotide or at least one 
LEC1 polypeptide and growing the transformed plant cell, wherein the 
responsive plant cell is transformed and grown under conditions sufficient 
to induce embryogenesis to provide a positive selection means. 

20 

36. The method of claim 35 wherein the plant cell is transformed with at least 
one polynucleotide. 

37. The method of claim 36 wherein at least one polynucleotide is operably 
25 linked to a promoter driving expression in a plant. 

38. The method of claim 35 further comprising altering media components to 
favor the growth of transformed plant cells. 

30 39. The method of claim 35 wherein the media components are altered to 
reduce somatic embryogenesis in non-transformed cells. 



40. The method of claim 37 wherein the polynucleotide is excised. 

-69- 



41. 



The method of claim 40 wherein the polynucleotide is flanked by FRT 
sequences to allow FLP mediated excision of the polynucleotide. 



42. A method for inducing apomixis in a plant cell comprising introducing into a 
5 responsive plant cell at least one LEC1 polypeptide or at least one LEC1 

polynucleotide and growing the plant cell, wherein the introducing and 
growing is done under conditions sufficient to produce a transformed 
somatic embryo. 



1 0 43. The method of claim 42 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 

44. The method of claim 45 wherein the at least one polynucleotide is operably 
linked to a promoter driving expression in a plant cell 

15 

45. The method of claim 42 further comprising suppressing in the plant cell the 
expression of an FIE polycomb polynucleotide using sense or antisense 
methods . 



20 46. The method of claim 42 further comprising growing the embryo under plant 
growing conditions to produce a regenerated plant. 

47. The method of claim 43 wherein the promoter is an inducible promoter. 

25 48. The method of claim 43 wherein the at least one polynucleotide is 
expressed in integument or nucellus tissue. 

49. A plant produced by the method of claim 42. 

30 50. The plant of claim 49, wherein the plant is male sterile. 
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51 . A method for increasing transformation efficiency comprising introducing at 
least one LEC1 polypeptide or at least one LEC1 polynucleotide into a 
responsive plant cell under conditions sufficient to increase transformation 
efficiency. 

5 

52. The method of claim 51 wherein the transformation is conducted in medium 
that retards growth of somatic embryo growth in non-transformed plants. 

53. The method of claim 52 wherein transformation is conducted with reduced 
1 0 levels of auxin or no auxin. 

54. The method of claim 51 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 

15 55. The method of claim 54 wherein the at least one polynucleotide is operably 
linked to a promoter driving expression in a plant cell 

56. The method of claim 51 wherein the plant cell is a recalcitrant cell. 

20 57. The method of claim 56 wherein the plant cell is an inbred cell. 

58. A method for increasing recovery of regenerated plants comprising 
introducing into a responsive plant cell at least one LEC1 polypeptide or at 
least one LEC1 polynucleotide and growing the plant cell, wherein the 

25 introducing and growing are under conditions sufficient to produce a 

regenerated plant. 

59. The method of claim 58 wherein the plant cell is transformed with at least 
one LEC1 polynucleotide. 



30 



60. The method of claim 59 wherein the at least one polynucleotide is operably 
linked to a promoter driving expression in a plant cell 
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61 . The method of claim 58 wherein the plant cell is a recalcitrant cell. 

62. The method of claim 61 wherein the plant cell is an inbred plant cell. 
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ABSTRACT OF THE DISCLOSURE 

The invention provides isolated nucleic acids and their encoded proteins 
which act as transcriptional activators and methods of use thereof. The invention 
further provides expression cassettes, transformed host cells, transgenic plants 
and plant parts, and antibody compositions. 
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Figure 1 



Arabidopsis REQDQYKPIAlfVIRIMEKTLPSHAKISDDAKETX^ 

Soybean REQDQYXPIANVIRIMKR ILPAHAKISBDAKETIQECVS E^ISFITAEAI^RCQREQRKTVTAEDVLWAMEKLGFDNYAHPLSLYLKRYR 

Soybean REQDRFKPIANVIRIMSKILPPHAKIStJDAKETXQECVSEYISFI^ 

Soybean REQDRFHPIANVIRIMRKILPPHAKISGDAKETXQECVSEYISFITGEANERC^ 

Ma i z e REQDRLHPIANVIR IHERVLPAKAKI SDDAKETIQECVSEYI SFITGEA*IERCQREQRKTI TAEDVLWAHSRLGFDDYVEPLGAYLHRYR 

A.mexicana REQDRYMPIANVIRIMKKVLPTHAKISDDAKETXQECTSEYISFITSEAK^ 

Vemonia REQDRFT4PIANVIRIMSKILPP~JUCISDD^ 

Wheat REQDRLMPIJ^IRIMSRALPAEAKISDDAK^ 

Mai z e REQDRLKPVANVSRIMRQVLPPYAKI SPDAXEVX QELLFGISSLSWRGETKRCHTERRKTVTSEDIWAMSPXGFDDYVAPLGAFLQRMR 
Maize ARGKTVTSEDIVWAMSSLGFDDYVAPLGAFLQRMR 



SEQUENCE LISTING 



<110> Lowe, Keith S. 

Gordon -Kamm, William J. 
Klein, Theodore M. 
Rasco-Gaunt, Sonriza 
Cahoon, Rebecca E. 
Sun, Xifan 
Hoerster, George J. 
Gregory, Carolyn A. 
Nadimpall i , Ramgopal 

<120> Transcriptional Activator Nucleic Acids, 
Polypeptides, and Methods of Use Thereof 



<130> 0943 

<150> 60/107,643 
<151> 1998-11-09 

<160> 23 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 1173 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (69) . . . (902) 

<400> 1 

ccacgcgtcc gccaccacac cacgagcgcg cgataaccct agctagcttc aggtagtagc 60 
gagagcca atg gac tec age age ttc etc cct gec gee ggc gcg gag aat 110 
Met Asp Ser Ser Ser Phe Leu Pro Ala Ala Gly Ala Glu Asn 
15 10 

ggc teg gcg gcg ggc ggc gec aac aat ggc ggc get get cag cag cat 158 
Gly Ser Ala Ala Gly Gly Ala Asn Asn Gly Gly Ala Ala Gin Gin His 
15 20 25 30 

gcg gcg ccg gcg ate cgc gag cag gac egg ctg atg ccg ate gcg aac 2 06 

Ala Ala Pro Ala lie Arg Glu Gin Asp Arg Leu Met Pro lie Ala Asn 
35 40 45 

gtg ate cgc ate atg egg cgc gtg ctg ccg gcg cac gee aag ate teg 254 
Val lie Arg lie Met Arg Arg Val Leu Pro Ala His Ala Lys lie Ser 
50 55 60 

gac gac gee aag gag acg ate cag gag tgc gtg teg gag tac ate age 3 02 

Asp Asp Ala Lys Glu Thr lie Gin Glu Cys Val Ser Glu Tyr lie Ser 
65 70 75 

ttc ate acg ggg gag gee aac gag egg tgc cag egg gag cag cgc aag 35 0 

Phe lie Thr Gly Glu Ala Asn Glu Arg Cys Gin Arg Glu Gin Arg Lys 
80 85 90 



acc ate acc gec gag gac gtg ctg tgg gee atg age cgc etc ggc ttc 398 

1 



Thr lie Thr Ala Glu Asp Val Leu Trp Ala Met Ser Arg Leu Gly Phe 
95 100 105 110 



gac gac tac gtc gag ccg etc ggc gec tac etc cac cgc tac cgc gag 446 
Asp Asp Tyr Val Glu Pro Leu Gly Ala Tyr Leu His Arg Tyr Arg Glu 
115 120 125 

ttc gag ggc gac gcg cgc ggc gtc ggg etc gtc ccg ggg gec gec cca 494 
Phe Glu Gly Asp Ala Arg Gly Val Gly Leu Val Pro Gly Ala Ala Pro 
130 135 140 

teg cgc ggc ggc gac cac cac ccg cac tec atg teg cca gcg gcg atg 542 
Ser Arg Gly Gly Asp His His Pro His Ser Met Ser Pro Ala Ala Met 
145 150 155 

etc aag tec cgc ggg cca gtc tec gga gee gee atg eta ccg cac cac 590 
Leu Lys Ser Arg Gly Pro Val Ser Gly Ala Ala Met Leu Pro His His 
160 165 170 

cac cac cac cac gac atg cag atg cac gee gee atg tac ggg gga acg 638 
His His His His Asp Met Gin Met His Ala Ala Met Tyr Gly Gly Thr 
175 180 185 190 

gec gtg ccc ccg ccg gee ggg cct cct cac cac ggc ggg ttc etc atg 686 
Ala Val Pro Pro Pro Ala Gly Pro Pro His His Gly Gly Phe Leu Met 
195 200 205 

cca cac cca cag ggt agt age cac tac ctg cct tac gcg tac gag ccc 734 
Pro His Pro Gin Gly Ser Ser His Tyr Leu Pro Tyr Ala Tyr Glu Pro 
210 215 220 

acg tac ggc ggt gag cac gee atg get gca tac tat gga ggc gee gcg 782 
Thr Tyr Gly Gly Glu His Ala Met Ala Ala Tyr Tyr Gly Gly Ala Ala 
225 230 235 

tac gcg ccc ggc aac ggc ggg age ggc gac ggc agt ggc agt ggc ggc 83 0 

Tyr Ala Pro Gly Asn Gly Gly Ser Gly Asp Gly Ser Gly Ser Gly Gly 
240 245 250 

ggt ggc ggg age gcg teg cac aca ccg cag ggc age ggc ggc ttg gag 878 
Gly Gly Gly Ser Ala Ser His Thr Pro Gin Gly Ser Gly Gly Leu Glu 
255 260 265 270 

cac ccg cac ccg ttc gcg tac aag tagctagttc gtacgtcgtt cgacttgagc 932 
His Pro His Pro Phe Ala Tyr Lys 
275 

aagecatega tetgetgate tgaacgtacg ctgtattgta cacgcatgca egtaegtate 992 

ggeggctage tctcctgttt aagttgtact gtgattctgt cccggccggc tagcaactta 1052 

gtatcttcct tcagtctcta gtttcttagc agtcgtagaa gtgttcaatg ettgecagtg 1112 

tgttgtttta gggccggggt aaaccatccg atgagattat ttcaaaaaaa aaaaaaaaaa 1172 

a 1173 

<210> 2 
<211> 278 
<212> PRT 
<213> Zea mays 

<400> 2 

Met Asp Ser Ser Ser Phe Leu Pro Ala Ala Gly Ala Glu Asn Gly Ser 

15 10 15 

Ala Ala Gly Gly Ala Asn Asn Gly Gly Ala Ala Gin Gin His Ala Ala 



2 



20 25 30 



Pro 


Ala 


lie 


Arg 


Glu 


Gin 


Asp 


Arg 


Leu 


Met 


Pro 


lie 


Ala 


Asn 


Val 


lie 






35 










40 










45 










lie 


Met 


Arg 


Arg 


Val 


Leu 


Pro 


Ala 


His 


Ala 


Lvs 


lie 


Ser 


Asp 


Asp 




50 










55 










60 










Ala 


Lys 


Glu 


Thr 


lie 


Gin 


Glu 


Cys 


Val 


Ser 


Glu 


Tyr 


lie 


Ser 


Phe 


He 


65 










70 










75 










80 


X 11 J. 




Glu 


Ala 


Asn 


Glu Arg 


Cys 


Gin 


Arcr 


Glu 


Gin 


Arg 


Lys 


Thr 


He 










85 










90 










95 




Thr 


Ala 


Glu 


Asp 


Val 


Leu 


Trp 


Ala 


Met 


Ser 


Arg 


Leu 


Gly 


Phe 


Asp 


Asp 








100 










105 










110 






Tyr 


V ClJL 






Leu 


Gly Ala 


Tyr 


Leu 


His 


Arg 


Tyr 


Arg 


Glu 


Phe 


Glu 






i i t: 

_L -L J 










120 










125 








Pi ir 

tjj.y 


Asp 


Ala 


Arg 


Gly Val 


Gly 


Leu 


Val 


Pro 


Gly Ala Ala 


Pro 


Ser 


Arg 




13 0 










135 










140 










ro -i r 

uiy 


\jiy 


ASp 


U-j Q 

IllD 


His 


Pro 


His 


Ser 


Met 


Ser 


Pro 


Ala 


Ala 


Met 


Leu 


Lys 


1 AC 










150 










155 










160 


Ser 


Arg 


ij-xy 


Pro 


Val 


Ser 


Gly Ala 


Ala 


Met 


Leu 


Pro 


His 


His 


His 


His 










165 










170 










175 




llJLS 


nlS 


Asp 




Gin 


Met 


His 


Ala 


Ala 


Met 


Tyr 


Gly Gly 


Thr 


Ala 


Val 








180 










185 










190 






Pit o 


Pro 


ri (J 


Ala 


Gly 


Pro 


Pro 


His 


His 


Gly 


Gly 


Phe 


Leu 


Met 


Pro 


His 






X. -7 ZJ 










200 










205 








Pro 




uiy 


Ser 


Ser 


His 


Tyr 


Leu 


Pro 




Ala 


Tyr 


Glu 


Pro 


Thr 


Tvr 




210 










215 










220 










Gly 


Gly 


Glu 


His 


Ala 


Met 


Ala 


Ala 


Tyr 


Tyr 


Gly Gly Ala Ala 


Tyr 


Ala 


225 










230 










235 










240 


Pro 


Gly 


Asn 


Gly Gly 


Ser 


Gly Asp 


Gly 


Ser 


Gly 


Ser 


Gly Gly 


Gly 


Gly 










245 










250 










255 




Gly 


Ser 


Ala 


Ser 


His 


Thr 


Pro 


Gin Gly 


Ser 


Gly Gly 


Leu 


Glu 


His 


Pro 








260 










265 










270 






His 


Pro 


Phe 


Ala 


Tyr 


Lys 























275 

<210> 3 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 3 

tagtagegag agccaatgga 2 0 

<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 4 

geegggacag aatcacagta 2 0 

<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 



3 



<223> primer 



<400> 5 

tagtagcgag agccaatgga 2 0 

<210> 6 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 6 

cccggcccta aaacaacaca 20 

<210> 7 
<211> 481 
<212> DNA 

<213> Argemone mexicana 

<220> 

<221> CDS 

<222> (44) . . . (481) 

<221> misc_f eature 
<222> (1) . . . (481) 
<223> n = A,T,C or G 

<400> 7 

cgagagaaag agttggtgaa gaagaagaag aagttgaaaa gag atg gaa cgt ggt 55 

Met Glu Arg Gly 
1 

ggt ggt ggt ggt ggt agt ggt ggt ggt ttc cat gga tat cag aaa etc 103 
Gly Gly Gly Gly Gly Ser Gly Gly Gly Phe His Gly Tyr Gin Lys Leu 
5 10 15 20 

cca aaa tea aac tec get gga atg atg etc teg gag eta teg aat aac 151 
Pro Lys Ser Asn Ser Ala Gly Met Met Leu Ser Glu Leu Ser Asn Asn 
25 30 35 

aac aac aat att gac gta aac tct aca tgt act gta cga gag caa gat 199 
Asn Asn Asn lie Asp Val Asn Ser Thr Cys Thr Val Arg Glu Gin Asp 
40 45 50 

cga tac atg cca att get aat gtg ate agg ate atg cgt aag gta ctt 24 7 

Arg Tyr Met Pro lie Ala Asn Val lie Arg lie Met Arg Lys Val Leu 
55 60 65 

cct act cat gec aag ate tct gac gat gee aaa gaa act ate caa gaa 295 
Pro Thr His Ala Lys He Ser Asp Asp Ala Lys Glu Thr He Gin Glu 
70 75 80 

tgt gtc tea gaa tac ate agt ttc ate aca agt gaa gec aat gat cgt 34 3 

Cys Val Ser Glu Tyr He Ser Phe He Thr Ser Glu Ala Asn Asp Arg 
85 90 95 100 

tgc caa cgt gaa caa aga aag aca ate aca get gaa gat gtt tta tgg 391 
Cys Gin Arg Glu Gin Arg Lys Thr He Thr Ala Glu Asp Val Leu Trp 
105 110 115 



4 



gcg atg age aaa eta ggg ntt gat gag tac att gaa cct eta act ctt 43 9 

Ala Met Ser Lys Leu Gly Xaa Asp Glu Tyr lie Glu Pro Leu Thr Leu 
120 125 130 

tac ctt caa cgt tat cgt gag ttt gaa ggt gna cgt tgg tea 4 81 

Tyr Leu Gin Arg Tyr Arg Glu Phe Glu Gly Xaa Arg Trp Ser 
135 140 145 



<210> 8 
<211> 146 
<212> PRT 

<213> Argemone mexicana 
<220> 

<221> VARIANT 

<222> (1) . . . (146) 

<223> Xaa = Any Amino Acid 





<400> 


8 


























Met 


Glu 


Arg 


Gly Gly Gly 


Gly Gly Gly 


Ser 


Gly Gly Gly 


Phe 


His 


Gly 


1 








5 










10 










15 




Tyr 


Gin 


Lys 


Leu 


Pro 


Lys 


Ser 


Asn 


Ser 


Ala 


Gly Met 


Met 


Leu 


Ser 


Glu 








20 










25 










30 






Leu 


Ser 


Asn 
35 


Asn 


Asn 


Asn 


Asn 


He 
40 


Asp 


Val 


Asn 


Ser 


Thr 
45 


Cys 


Thr 


Val 


Arg 


Glu 
50 


Gin 


Asp 


Arg 


Tyr 


Met 
55 


Pro 


He 


Ala 


Asn 


Val 
60 


He 


Arg 


He 


Met 


Arg 


Lys 


Val 


Leu 


Pro 


Thr 


His 


Ala 


Lys 


He 


Ser 


Asp 


Asp 


Ala 


Lys 


Glu 


65 










70 










75 










80 


Thr 


He 


Gin 


Glu 


Cys 
85 


Val 


Ser 


Glu 


Tyr 


He 
90 


Ser 


Phe 


He 


Thr 


Ser 
95 


Glu 


Ala 


Asn 


Asp 


Arg 
100 


Cys 


Gin 


Arg 


Glu 


Gin 
105 


Arg 


Lys 


Thr 


He 


Thr 
110 


Ala 


Glu 


Asp 


Val 


Leu 
115 


Trp 


Ala 


Met 


Ser 


Lys 
120 


Leu 


Gly 


Xaa 


Asp 


Glu 
125 


Tyr 


He 


Glu 


Pro 


Leu 
130 


Thr 


Leu 


Tyr 


Leu 


Gin 
135 


Arg 


Tyr 


Arg 


Glu 


Phe 
140 


Glu 


Gly 


Xaa 


Arg 



Trp Ser 
145 



<210> 9 

<211> 942 

<212> DNA 

<213> Glycine max 

<220> 

<221> CDS 

<222> (3) . . . (722) 



<400> 9 

gc acg age tct ctt ata ate aca cac aca cct acc tta ata get atg 4 7 

Thr Ser Ser Leu He He Thr His Thr Pro Thr Leu He Ala Met 
15 10 15 

gaa act gga ggc ttt cac ggc tac cgc aag etc ccc aac acc acc get 95 
Glu Thr Gly Gly Phe His Gly Tyr Arg Lys Leu Pro Asn Thr Thr Ala 
20 25 30 



ggg ttg aag ctg tea gtg tea gac atg aac atg agg cag cag gta gca 143 
Gly Leu Lys Leu Ser Val Ser Asp Met Asn Met Arg Gin Gin Val Ala 
35 40 45 



tea tea gat cac agt gca gec aca gga gag gag aac gaa tgc acg gtg 191 
Ser Ser Asp His Ser Ala Ala Thr Gly Glu Glu Asn Glu Cys Thr Val 
50 55 60 

agg gag caa gac agg ttc atg cca ate gec aac gtg att agg ate atg 23 9 

Arg Glu Gin Asp Arg Phe Met Pro lie Ala Asn Val lie Arg lie Met 
65 70 75 

cgc aag att etc cct cca cac gca aaa ate teg gac gat gca aaa gaa 287 
Arg Lys lie Leu Pro Pro His Ala Lys lie Ser Asp Asp Ala Lys Glu 
80 85 90 95 

aca ate caa gag tgc gtg tct gag tac ate age ttc ate aca ggt gag 335 
Thr lie Gin Glu Cys Val Ser Glu Tyr lie Ser Phe lie Thr Gly Glu 
100 105 110 

gcg aac gag cgt tgc cag agg gag cag egg aag acc ata acc gca gag 383 
Ala Asn Glu Arg Cys Gin Arg Glu Gin Arg Lys Thr lie Thr Ala Glu 
115 120 125 

gac gtg ctt tgg gee atg age aag ctt gga ttc gac gac tac ate gaa 431 
Asp Val Leu Trp Ala Met Ser Lys Leu Gly Phe Asp Asp Tyr lie Glu 
130 135 140 

ccg ttg acc atg tac ctt cac cgc tac cgt gaa ctt gag ggt gac cgc 47 9 

Pro Leu Thr Met Tyr Leu His Arg Tyr Arg Glu Leu Glu Gly Asp Arg 
145 150 155 



acc tct atg agg ggt gaa cca etc ggg aag agg act gtg gaa tac gee 
Thr Ser Met Arg Gly Glu Pro Leu Gly Lys Arg Thr Val Glu Tyr Ala 
160 165 170 175 

acg ctt ggt gtt get act get ttt gtc cct cca ccc tat cat cac cac 
Thr Leu Gly Val Ala Thr Ala Phe Val Pro Pro Pro Tyr His His His 
180 185 190 



527 



575 



aat ggg tac ttt ggt get gec atg ccc atg ggg act tac gtt agg gaa 623 
Asn Gly Tyr Phe Gly Ala Ala Met Pro Met Gly Thr Tyr Val Arg Glu 
195 200 205 

gcg cca cca aat aca gee tec tec cat cac cac cac cac cac cac cac 671 
Ala Pro Pro Asn Thr Ala Ser Ser His His His His His His His His 
210 215 220 

cac cat get cgt gga ate tec aat get cat gaa cca aat get cgc tec 719 
His His Ala Arg Gly lie Ser Asn Ala His Glu Pro Asn Ala Arg Ser 
225 230 235 

ata taaaattata taattatgac taggattcag aacaagactt gatgatgatt 772 

He 

240 

agcttaactc tcagtaattg gtgctagagt actactgttg ttgaggatac tttattttat 832 

aattaagggc tgggaaggga gttagtatat tcctaatcct aactatgtgc atctttaatt 892 

tatgaaatca ctttgtttta acctttgatg aaaaaaaaaa aaaaaaaaaa 942 

<210> 10 

<211> 240 

<212> PRT 

<213> Glycine max 



6 



<400> 10 

Thr Ser Ser Leu He He Thr His Thr Pro Thr Leu He Ala Met Glu 

15 10 15 

Thr Gly Gly Phe His Gly Tyr Arg Lys Leu Pro Asn Thr Thr Ala Gly 

20 25 30 

Leu Lys Leu Ser Val Ser Asp Met Asn Met Arg Gin Gin Val Ala Ser 

35 40 45 

Ser Asp His Ser Ala Ala Thr Gly Glu Glu Asn Glu Cys Thr Val Arg 

50 55 60 

Glu Gin Asp Arg Phe Met Pro He Ala Asn Val He Arg He Met Arg 
65 70 75 80 

Lys He Leu Pro Pro His Ala Lys He Ser Asp Asp Ala Lys Glu Thr 

85 90 95 

He Gin Glu Cys Val Ser Glu Tyr He Ser Phe He Thr Gly Glu Ala 

100 105 110 

Asn Glu Arg Cys Gin Arg Glu Gin Arg Lys Thr He Thr Ala Glu Asp 

115 120 125 

Val Leu Trp Ala Met Ser Lys Leu Gly Phe Asp Asp Tyr He Glu Pro 

130 135 140 

Leu Thr Met Tyr Leu His Arg Tyr Arg Glu Leu Glu Gly Asp Arg Thr 
145 150 155 160 

Ser Met Arg Gly Glu Pro Leu Gly Lys Arg Thr Val Glu Tyr Ala Thr 

165 170 175 

Leu Gly Val Ala Thr Ala Phe Val Pro Pro Pro Tyr His His His Asn 

180 185 190 

Gly Tyr Phe Gly*Ala Ala Met Pro Met Gly Thr Tyr Val Arg Glu Ala 

195 200 205 

Pro Pro Asn Thr Ala Ser Ser His His His His His His His His His 

210 215 220 

His Ala Arg Gly He Ser Asn Ala His Glu Pro Asn Ala Arg Ser He 
225 230 235 240 

<210> 11 
<211> 905 
<212> DNA 

<213> Veronia mespilif olia 

<220> 

<221> CDS 

<222> (58) . . . (699) 

<400> 11 

gcacgagcca atttctagag agagaacgag agagaattct ctaaagagga aaaatag atg 60 

Met 
1 

gaa cgt gga gga ggt ttc cat ggc tac cac agg etc ccc ate cac cct 108 
Glu Arg Gly Gly Gly Phe His Gly Tyr His Arg Leu Pro He His Pro 
5 10 15 

aca tct gga ate caa caa teg gat atg aag eta aag eta cca gaa atg 156 
Thr Ser Gly He Gin Gin Ser Asp Met Lys Leu Lys Leu Pro Glu Met 
20 25 30 

ace aac aat aac teg tec act gat gac aat gag tgc ace gtt cga gaa 2 04 

Thr Asn Asn Asn Ser Ser Thr Asp Asp Asn Glu Cys Thr Val Arg Glu 
35 40 45 

cag gac cgc ttc atg ccg ata gca aac gtg ate cgc ate atg egg aag 252 
Gin Asp Arg Phe Met Pro He Ala Asn Val He Arg He Met Arg Lys 
50 55 60 65 



7 



ate ctt cct cca cat gec aag ate tct gat gat gec aaa gag acg ate 3 00 

lie Leu Pro Pro His Ala Lys lie Ser Asp Asp Ala Lys Glu Thr lie 
70 75 80 

caa gaa tgt gtt tea gag tac att age ttt gtc aca ggc gag gca aat 34 8 

Gin Glu Cys Val Ser Glu Tyr lie Ser Phe Val Thr Gly Glu Ala Asn 
85 90 95 

gac cgc tgc cag cgt gag caa agg aag ace ate aca get gaa gat gtg 3 96 

Asp Arg Cys Gin Arg Glu Gin Arg Lys Thr lie Thr Ala Glu Asp Val 
100 105 110 

etc tgg get atg age aaa ctg gga ttt gat gat tat ate gag ccc ttg 444 
Leu Trp Ala Met Ser Lys Leu Gly Phe Asp Asp Tyr lie Glu Pro Leu 
115 120 125 

act gtg tat etc cat cgc tac agg gag ttt gat ggt ggc gaa cgt gga 492 
Thr Val Tyr Leu His Arg Tyr Arg Glu Phe Asp Gly Gly Glu Arg Gly 
130 135 140 145 

tec ata agg ggt gag ccc ctt gtg aag agg agt act tct gat cct ggt 54 0 

Ser lie Arg Gly Glu Pro Leu Val Lys Arg Ser Thr Ser Asp Pro Gly 
150 155 160 

cac ttt ggg atg get tct ttt gtg cct get ttt cat atg ggt cat cat 58 8 

His Phe Gly Met Ala Ser Phe Val Pro Ala Phe His Met Gly His His 
165 170 175 

aac ggc ttc ttt ggt cct gca age att ggt ggt ttc ctg aaa gac cca 63 6 

Asn Gly Phe Phe Gly Pro Ala Ser lie Gly Gly Phe Leu Lys Asp Pro 
180 185 190 

teg agt get ggc cct teg gga cct gca gtc get ggg ttt gag ccg tat 684 
Ser Ser Ala Gly Pro Ser Gly Pro Ala Val Ala Gly Phe Glu Pro Tyr 
195 200 205 

get cag tgt aaa gag taactgcaaa aagtaggggt tgggatgaga tgatgatgat 73 9 

Ala Gin Cys Lys Glu 

210 

ggtggtggtg gtggtggttt gttttgtttt gttctttctt ttttttttct tctttctttt 799 

cttggtcatt gaggaacaaa cttacattgg ttcactttgg ctaggcatgt aaacggttaa 8 59 

catgettate aagtagtagt tttcgatcaa aaaaaaaaaa aaaaaa 905 

<210> 12 
<211> 214 
<212> PRT 

<213> Veronia mespilif olia 
<400> 12 

Met Glu Arg Gly Gly Gly Phe His Gly Tyr His Arg Leu Pro lie His 

15 10 15 

Pro Thr Ser Gly lie Gin Gin Ser Asp Met Lys Leu Lys Leu Pro Glu 

20 25 30 

Met Thr Asn Asn Asn Ser Ser Thr Asp Asp Asn Glu Cys Thr Val Arg 

35 40 45 

Glu Gin Asp Arg Phe Met Pro lie Ala Asn Val lie Arg lie Met Arg 

50 55 60 

Lys lie Leu Pro Pro His Ala Lys lie Ser Asp Asp Ala Lys Glu Thr 
65 70 75 80 

lie Gin Glu Cys Val Ser Glu Tyr lie Ser Phe Val Thr Gly Glu Ala 
85 90 95 



8 



Asn Asp Arg Cys Gin Arg Glu Gin Arg Lys Thr lie Thr Ala Glu Asp 

100 105 110 

Val Leu Trp Ala Met Ser Lys Leu Gly Phe Asp Asp Tyr lie Glu Pro 

115 120 125 

Leu Thr Val Tyr Leu His Arg Tyr Arg Glu Phe Asp Gly Gly Glu Arg 

130 135 140 

Gly Ser lie Arg Gly Glu Pro Leu Val Lys Arg Ser Thr Ser Asp Pro 
145 150 155 160 

Gly His Phe Gly Met Ala Ser Phe Val Pro Ala Phe His Met Gly His 

165 170 175 

His Asn Gly Phe Phe Gly Pro Ala Ser lie Gly Gly Phe Leu Lys Asp 

180 185 190 

Pro Ser Ser Ala Gly Pro Ser Gly Pro Ala Val Ala Gly Phe Glu Pro 

195 200 205 

Tyr Ala Gin Cys Lys Glu 
210 

<210> 13 
<211> 763 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1) . . . (480) 

<400> 13 

gca cga ggc aag acc gtc acc tec gag gac ate gtg tgg gec atg age 48 
Ala Arg Gly Lys Thr Val Thr Ser Glu Asp lie Val Trp Ala Met Ser 
15 10 15 

cgc etc ggc ttc gac gac tac gtc gcg ccc etc ggc gec ttc etc cag 96 
Arg Leu Gly Phe Asp Asp Tyr Val Ala Pro Leu Gly Ala Phe Leu Gin 
20 25 30 

cgc atg cgc gac gac age gac cac ggc ggt gaa gag cgc ggc ggc cct 144 
Arg Met Arg Asp Asp Ser Asp His Gly Gly Glu Glu Arg Gly Gly Pro 
35 40 45 

gca ggg cgt ggt ggc teg cgc cgc ggc teg teg tec ttg ccg etc cac 192 
Ala Gly Arg Gly Gly Ser Arg Arg Gly Ser Ser Ser Leu Pro Leu His 
50 55 60 

tgc ccg cag cag atg cac cac ctg cac cca gec gtc tgc egg cgt ccg 24 0 

Cys Pro Gin Gin Met His His Leu His Pro Ala Val Cys Arg Arg Pro 
65 70 75 80 

cac cag age gtg teg cct get gca gga tac gee gtc egg ccc gtt ccc 288 
His Gin Ser Val Ser Pro Ala Ala Gly Tyr Ala Val Arg Pro Val Pro 
85 90 95 

cgc ccg atg cca gee cgt ggg tac cgc atg cag ggc gga gac cac cgc 336 
Arg Pro Met Pro Ala Arg Gly Tyr Arg Met Gin Gly Gly Asp His Arg 
100 105 110 

age gtg ggc ggc gtg get ccc tgc age tac gga ggg gcg etc gtc cag 384 
Ser Val Gly Gly Val Ala Pro Cys Ser Tyr Gly Gly Ala Leu Val Gin 
115 120 125 

gee ggt gga acc caa cac gtt gtt gga ttc cac gac gac gag gca age 432 
Ala Gly Gly Thr Gin His Val Val Gly Phe His Asp Asp Glu Ala Ser 
130 135 140 



9 



tct teg agt gaa aat ccg ccg ccg gag ggg cgt gec get ggc teg aac 48 0 

Ser Ser Ser Glu Asn Pro Pro Pro Glu Gly Arg Ala Ala Gly Ser Asn 
145 150 155 160 

tagectaget tctcagttcc ccgtgtacaa taagaggggc ggtcgcggcg ccgcgccgcg 540 

cccttgggtt gggccgggcg etatgetgea gtttggtttg taaactaacg agectagggt 600 

agctggtgca cgcgcgccac ctcgccggac gtcgccgtcg tegteggcat ggacttaacc 660 

ggcgggccct gttgttattt ctcaagtttg tagccaacgc actgttcggt gcgttccata 720 

atttaattta ccatgttgct ctcgaaaaaa aaaaaaaaaa aaa 763 

<210> 14 
<211> 160 
<212> PRT 
<213> Zea mays 





<400> 


14 


























Ala 


Arg 


Gly 


Lys 


Thr 


Val 


Thr 


Ser 


Glu 


Asp 


lie 


Val 


Trp 


Ala 


Met 


Ser 


1 








5 










10 










15 




Arg 


Leu 


Gly 


Phe 


Asp 


Asp 


Tyr 


Val 


Ala 


Pro 


Leu 


Gly 


Ala 


Phe 


Leu 


Gin 








20 










25 










30 






Arg 


Met 


Arg 


Asp 


Asp 


Ser 


Asp 


His 


Gly 


Gly Glu 


Glu 


Arg 


Gly Gly 


Pro 






35 










40 










45 








Ala 


Gly 


Arg 


Gly 


Gly 


Ser 


Arg 


Arg 


Gly 


Ser 


Ser 


Ser 


Leu 


Pro 


Leu 


His 




50 










55 










60 










Cys 


Pro 


Gin 


Gin 


Met 


His 


His 


Leu 


His 


Pro 


Ala 


Val 


Cys 


Arg 


Arg 


Pro 


65 










70 










75 










80 


His 


Gin 


Ser 


Val 


Ser 


Pro 


Ala 


Ala 


Gly Tyr 


Ala 


Val 


Arg 


Pro 


Val 


Pro 










85 










90 










95 




Arg 


Pro 


Met 


Pro 


Ala 


Arg 


Gly Tyr 


Arg 


Met 


Gin 


Gly 


Gly Asp 


His 


Arg 








100 










105 










110 






Ser 


Val 


Gly 


Gly 


Val 


Ala 


Pro 


Cys 


Ser 


Tyr 


Gly 


Gly 


Ala 


Leu 


Val 


Gin 






115 










120 










125 








Ala 


Gly 


Gly 


Thr 


Gin 


His 


Val 


Val 


Gly 


Phe 


His 


Asp 


Asp 


Glu 


Ala 


Ser 




130 










135 










14 0 










Ser 


Ser 


Ser 


Glu 


Asn 


Pro 


Pro 


Pro 


Glu 


Gly Arg Ala 


Ala 


Gly 


Ser 


Asn 


145 










150 










155 










160 



<210> 15 
<211> 622 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (3) . . . (622) 

<221> misc_f eature 
<222> (1) . . . (622) 
<223> n = A,T,C or G 

<400> 15 

gc atg aat aat ccc caa aac cct aaa gec agt get cct tgc ace ttg 47 
Met Asn Asn Pro Gin Asn Pro Lys Ala Ser Ala Pro Cys Thr Leu 
15 10 15 

cca ccg gag ctt ccc aaa gaa gca gtg gcg ace gac gaa gca ccg ccg 95 
Pro Pro Glu Leu Pro Lys Glu Ala Val Ala Thr Asp Glu Ala Pro Pro 
20 25 30 

cca atg ggc aac aac aac aac acg gaa teg gcg acg gcg acg atg gtc 143 
Pro Met Gly Asn Asn Asn Asn Thr Glu Ser Ala Thr Ala Thr Met Val 



10 



35 



40 



45 



egg gag cag gac egg ctg atg ccc gtg gec aac gtg tec cgc ate atg 191 
Arg Glu Gin Asp Arg Leu Met Pro Val Ala Asn Val Ser Arg lie Met 
50 55 60 

cgc caa gtg ctg cct ccg tac gec aag ate tec gac gac gee can gaa 23 9 

Arg Gin Val Leu Pro Pro Tyr Ala Lys lie Ser Asp Asp Ala Xaa Glu 
65 70 75 

gtn ate caa gaa ttg ctn ttc gga att tea tea ctt ncg tec tgg cga 28 7 

Xaa lie Gin Glu Leu Xaa Phe Gly lie Ser Ser Leu Xaa Ser Trp Arg 
80 85 90 95 

ggc gaa acg aag egg tgc cac acc gag cgc cgc aag acc gtc acc tec 335 
Gly Glu Thr Lys Arg Cys His Thr Glu Arg Arg Lys Thr Val Thr Ser 
100 105 110 

gaa gac ate gtg tgg gec atg age cgc etc ggc ttc gac gac tac gtc 3 83 

Glu Asp lie Val Trp Ala Met Ser Arg Leu Gly Phe Asp Asp Tyr Val 
115 120 125 

gcg ccc etc ggc gee ttc etc cag cgc atg cgc gac nac age gaa cac 431 
Ala Pro Leu Gly Ala Phe Leu Gin Arg Met Arg Asp Xaa Ser Glu His 
130 135 140 

ggg ggt gaa aac gcg gcg gee tgc ang ggg tng tgg ten cgc cgc ggg 47 9 

Gly Gly Glu Asn Ala Ala Ala Cys Xaa Gly Xaa Trp Xaa Arg Arg Gly 
145 150 155 

teg tct net tgg cgc tec ctt gee gca ana gat gac aac ttg cac caa 52 7 

Ser Ser Xaa Trp Arg Ser Leu Ala Ala Xaa Asp Asp Asn Leu His Gin 
160 165 170 175 

acg tct gee ggg ntc gga cca aaa ctn ttc cct gtt gca gga ata ccc 575 
Thr Ser Ala Gly Xaa Gly Pro Lys Xaa Phe Pro Val Ala Gly lie Pro 
180 185 190 

gtc cng ggc cnt tec ccc ccn aat cca acc att tgg ttt ccc ctt gc 622 
Val Xaa Gly Xaa Ser Pro Xaa Asn Pro Thr lie Trp Phe Pro Leu 
195 200 205 



<210> 16 
<211> 206 
<212> PRT 
<213> Zea mays 

<220> 

<221> VARIANT 

<222> {!)... (206) 

<223> Xaa = Any Amino Acid 



Thr Leu Pro 
15 

Pro Pro Pro 
30 

Met Val Arg 
lie Met Arg 



<400> 16 
Met Asn Asn Pro Gin Asn Pro 

1 5 
Pro Glu Leu Pro Lys Glu Ala 
20 

Met Gly Asn Asn Asn Asn Thr 
35 

Glu Gin Asp Arg Leu Met Pro 
50 55 



Lys Ala Ser Ala Pro Cys 
10 

Val Ala Thr Asp Glu Ala 
25 

Glu Ser Ala Thr Ala Thr 
40 45 
Val Ala Asn Val Ser Arg 
60 

11 



Gin 


Val 


Leu 


Pro 


Pro 


Tyr 


Ala 


Lys 


He 


Ser 


Asp 


Asp 


Ala 


Xaa 


Glu 


Xaa 












70 










75 










80 


lie 


Gin 


Glu 


Leu 


Xaa 
8 5 


Phe 


Glv 


He 


Ser 


Ser 
90 


Leu 


Xaa 


Ser 


Trp 


Arg 
95 


Gly 


Glu 


Thr 


Lys 


Arcr 
100 


Cys 


His 


Thr 


Glu 


Arg 
105 


Arg 


Lys 


Thr 


Val 


Thr 
110 


Ser 


Glu 


Asp 


He 


Val 

11 c 
J. _LO 


Trp 


Ala 


Met 


Ser 


Arg 
12 0 


Leu 


Gly 


Phe 


Asp 


Asp 
125 


Tyr 


Val 


Ala 


Pro 


Leu 


Gly 


Ala 


Phe 


Leu 


Gin 


Arg 


Met 


Arg Asp 


Xaa 


Ser 


Glu 


His 


Gly 




Xj U 










135 










140 










Gly 


Glu 


Asn 


Ala 


Ala 


Ala 


Cys 


Xaa 


Gly 


Xaa 


Trp 


Xaa 


Arg 


Arg 


Gly 


Ser 












i en 

J- 3 U 










155 










160 


Ser 


Xaa 


Trp 


Arg 


Ser 
165 


Leu 


Ala 


Ala 


Xaa 


Asp 
170 


Asp 


Asn 


Leu 


His 


Gin 
175 


Thr 


Ser 


Ala 


Gly 


Xaa 
180 


Gly 


Pro 


Lys 


Xaa 


Phe 
185 


Pro 


Val 


Ala 


Gly 


He 
190 


Pro 


Val 


Xaa 


Gly 


Xaa 
195 


Ser 


Pro 


Xaa 


Asn 


Pro 
200 


Thr 


He 


Trp 


Phe 


Pro 
205 


Leu 







<210> 17 

<211> 1121 

<212> DNA 

<213> Glycine max 

<220> 

<221> CDS 

<222> (3) . . . (1121) 

<400> 17 

gc acg agg gaa act gga ggc ttt cat ggc tac cgc aag etc ccc aac 4 7 

Thr Arg Glu Thr Gly Gly Phe His Gly Tyr Arg Lys Leu Pro Asn 
15 10 15 

aca acc tct ggg ttg aag ctg tea gtg tea gac atg aac atg aac atg 95 
Thr Thr Ser Gly Leu Lys Leu Ser Val Ser Asp Met Asn Met Asn Met 
20 25 30 

agg cag cag cag gta gca tea tea gat cag aac tgc age aac cac agt 143 
Arg Gin Gin Gin Val Ala Ser Ser Asp Gin Asn Cys Ser Asn His Ser 
35 40 45 

gca gca gga gag gag aac gaa tgc acg gtg agg gag caa gac agg ttc 191 
Ala Ala Gly Glu Glu Asn Glu Cys Thr Val Arg Glu Gin Asp Arg Phe 
50 55 60 

atg cca ate get aac gtg ata egg ate atg cgc aag att etc cct cca 239 
Met Pro He Ala Asn Val He Arg He Met Arg Lys He Leu Pro Pro 
65 70 75 

cac gca aaa ate tec gat gat gca aag gag aca ate caa gag tgc gtg 28 7 

His Ala Lys He Ser Asp Asp Ala Lys Glu Thr He Gin Glu Cys Val 
80 85 90 95 

teg gag tac ate age ttc ate acc ggg gag gec aac gag cgt tgc cag 335 
Ser Glu Tyr He Ser Phe He Thr Gly Glu Ala Asn Glu Arg Cys Gin 
100 105 110 

agg gag cag cgc aag acc ata acc gca gag gac gtg ctt tgg gca atg 383 
Arg Glu Gin Arg Lys Thr He Thr Ala Glu Asp Val Leu Trp Ala Met 
115 120 125 

agt aag ctt gga ttc gac gac tac ate gaa ccg tta acc atg tac ctt 431 
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Ser Lys Leu Gly Phe Asp Asp Tyr lie Glu Pro Leu Thr Met Tyr Leu 
130 135 140 



cac cgc tac cgt gag ctg gag ggt gac cgc acc tct atg agg ggt gaa 47 9 

His Arg Tyr Arg Glu Leu Glu Gly Asp Arg Thr Ser Met Arg Gly Glu 
145 150 155 

ccg etc ggg aag agg act gtg gaa tat gec acg ctt get act get ttt 52 7 

Pro Leu Gly Lys Arg Thr Val Glu Tyr Ala Thr Leu Ala Thr Ala Phe 
160 165 170 175 

gtg ccg cca ccc ttt cat cac cac aat ggc tac ttt ggt get gec atg 575 
Val Pro Pro Pro Phe His His His Asn Gly Tyr Phe Gly Ala Ala Met 
180 185 190 

ccc atg ggg act tac gtt agg gaa acg cca cca aat get gcg tea tct 623 
Pro Met Gly Thr Tyr Val Arg Glu Thr Pro Pro Asn Ala Ala Ser Ser 
195 200 205 

cat cac cat cat gga ate tec aat get cat gaa cca aat get cgc tec 671 
His His His His Gly lie Ser Asn Ala His Glu Pro Asn Ala Arg Ser 
210 215 220 

ata taa aat taa tga aga gta ctg ttc agt agg aga aca aga ctt ctt 719 
lie * Asn * * Arg Val Leu Phe Ser Arg Arg Thr Arg Leu Leu 
225 230 235 

gga ctt gat tag ctt aac tct cag tga ttg gtg tta gag tac tgt tgt 767 
Gly Leu Asp * Leu Asn Ser Gin * Leu Val Leu Glu Tyr Cys Cys 
240 245 250 

tga gga tgg tta att tta taa tta agg get ggg aat tgg gga gtt agt 815 

* Gly Trp Leu lie Leu * Leu Arg Ala Gly Asn Trp Gly Val Ser 

255 260 

ata tat tec taa tec taa tta tgt gca tct tta att tat gga ata act 863 

lie Tyr Ser * Ser * Leu Cys Ala Ser Leu lie Tyr Gly lie Thr 

265 270 275 

ttg ttt ttt gtt tta act tct gat aat ttg gat ttt ctg atg ttt aat 911 

Leu Phe Phe Val Leu Thr Ser Asp Asn Leu Asp Phe Leu Met Phe Asn 

280 285 290 

gtg gtt ttg tct ate cct tat taa cag tgc caa get taa ggt ttt age 959 

Val Val Leu Ser lie Pro Tyr * Gin Cys Gin Ala * Gly Phe Ser 
295 300 305 

cat get cca aaa tgg aat act tgt act gtt atg ttg ttc tgg tag tga 1007 

His Ala Pro Lys Trp Asn Thr Cys Thr Val Met Leu Phe Trp * * 
310 315 320 

tgg tg a tga aac ct g caa gtt at g ttt atg tat aaa gec act att gat 1055 
Trp * * Asn Leu Gin Val Met Phe Met Tyr Lys Ala Thr lie Asp 
325 330 335 

caa aat tag aga aat tat cat tta ata agt ate etc cca tgt taa ttt 1103 
Gin Asn * Arg Asn Tyr His Leu lie Ser lie Leu Pro Cys * Phe 
340 345 350 

taa aaa aaa aaa aaa aaa 1121 

* Lys Lys Lys Lys Lys 

355 
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<210> 18 

<211> 355 

<212> PRT 

<213> Glycine max 





<400> 


1 o 

lo 


Thr 


Arg 


Glu 


Thr 


1 








Thr 


Ser 


Gly 


Leu 








*"> A 


Gin 


Gin 


Gin 


Val 






35 




Ala 


Gly Glu 


Glu 




50 






Pro 


He 


Ala 


Asn 


65 








Ala 


Lys 


He 


Ser 


Glu 


Tyr 


He 


Ser 








100 


Glu 


Gin 


Arg 


Lys 






115 




Lys 


Leu 


Gly 


Phe 




130 






Arg 


Tyr 


Arg 


Glu 


145 








Leu 


Gly Lys 


Arg 


Pro 


Pro 


Pro 


Phe 








180 


Met 


Gly 


Thr 


Tyr 






195 




His 


His 


His 


Gly 




210 






Asn 


Arg 


Val 


Leu 


225 








Asn 


Ser 


Gin 


Leu 


Arg 


Ala 


Gly 


Asn 








260 


Leu 


He 


Tyr 


Gly 






275 




Asp 


Phe 


Leu 


Met 




290 






Ala 


Gly 


Phe 


Ser 


305 








Phe 


Trp 


Trp 


Asn 


Gin 


Asn 


Arg 


Asn 








340 


Lys 


Lys 


Lys 








355 





Gly Gly Phe His 
5 

Lys Leu Ser Val 

Ala Ser Ser Asp 
40 

Asn Glu Cys Thr 
55 

Val He Arg He 
70 

Asp Asp Ala Lys 
85 

Phe He Thr Gly 

Thr He Thr Ala 
120 

Asp Asp Tyr He 
135 

Leu Glu Gly Asp 
150 

Thr Val Glu Tyr 
165 

His His His Asn 

Val Arg Glu Thr 
200 

He Ser Asn Ala 
215 

Phe Ser Arg Arg 
230 

Val Leu Glu Tyr 
245 

Trp Gly Val Ser 

He Thr Leu Phe 
280 

Phe Asn Val Val 
295 

His Ala Pro Lys 
310 

Leu Gin Val Met 
325 

Tyr His Leu He 



Gly Tyr Arg Lys 
10 

Ser Asp Met Asn 
25 

Gin Asn Cys Ser 

Val Arg Glu Gin 
60 

Met Arg Lys He 
75 

Glu Thr He Gin 
90 

Glu Ala Asn Glu 
105 

Glu Asp Val Leu 

Glu Pro Leu Thr 
140 

Arg Thr Ser Met 
155 

Ala Thr Leu Ala 
170 

Gly Tyr Phe Gly 
185 

Pro Pro Asn Ala 

His Glu Pro Asn 
220 

Thr Arg Leu Leu 
235 

Cys Cys Gly Trp 
250 

He Tyr Ser Ser 
265 

Phe Val Leu Thr 

Leu Ser He Pro 
300 

Trp Asn Thr Cys 
315 

Phe Met Tyr Lys 
330 

Ser He Leu Pro 
345 



Leu Pro Asn Thr 
15 

Met Asn Met Arg 
30 

Asn His Ser Ala 
45 

Asp Arg Phe Met 

Leu Pro Pro His 
80 

Glu Cys Val Ser 
95 

Arg Cys Gin Arg 
110 

Trp Ala Met Ser 
125 

Met Tyr Leu His 

Arg Gly Glu Pro 
160 

Thr Ala Phe Val 
175 

Ala Ala Met Pro 
190 

Ala Ser Ser His 
205 

Ala Arg Ser He 

Gly Leu Asp Leu 
240 

Leu He Leu Leu 
255 

Leu Cys Ala Ser 
270 

Ser Asp Asn Leu 
285 

Tyr Gin Cys Gin 

Thr Val Met Leu 
320 

Ala Thr He Asp 
335 

Cys Phe Lys Lys 
350 



<210> 19 

<211> 796 

<212> DNA 

<213> Glycine max 



<220> 
<221> CDS 



14 



<222> (1) . . . (513) 



<400> 19 

gca cga gca atg gcg gga gtg agg gaa cag gac cag tac atg ccg ata 

Ala Arg Ala Met Ala Gly Val Arg Glu Gin Asp Gin Tyr Met Pro He 

15 10 15 

gcg aac gtg ata agg ate atg cgt egg att ctg cca gcg cac gcg aag 

Ala Asn Val He Arg He Met Arg Arg He Leu Pro Ala His Ala Lys 

20 25 30 



48 



96 



ate tea gac gac gcg aag gag acg ate cag gag tgc gtg tct gag tac 144 
He Ser Asp Asp Ala Lys Glu Thr He Gin Glu Cys Val Ser Glu Tyr 
35 40 45 

ate agt ttc ate acg gcg gag gcg aac gag egg tgc cag egg gag cag 192 
He Ser Phe He Thr Ala Glu Ala Asn Glu Arg Cys Gin Arg Glu Gin 
50 55 60 

egg aag acg gtg acc gca gag gat gtg ttg tgg gcg atg gag aag ctt 24 0 

Arg Lys Thr Val Thr Ala Glu Asp Val Leu Trp Ala Met Glu Lys Leu 
65 70 75 80 

ggc ttt gac aac tac get cac cct etc tct ctt tac ctt cac cgc tac 288 
Gly Phe Asp Asn Tyr Ala His Pro Leu Ser Leu Tyr Leu His Arg Tyr 
85 90 95 

cgc gag agt gaa gga gaa cct get tct gtc aga cgc get tct tct gca 336 
Arg Glu Ser Glu Gly Glu Pro Ala Ser Val Arg Arg Ala Ser Ser Ala 
100 105 110 

atg ggg ate aat aat aat atg gtg cac cca cct tat att aat tct cat 384 
Met Gly He Asn Asn Asn Met Val His Pro Pro Tyr He Asn Ser His 
115 120 125 

ggc ttt gga atg ttt gat ttt gac cca tea teg caa ggg ttt tac agg 432 
Gly Phe Gly Met Phe Asp Phe Asp Pro Ser Ser Gin Gly Phe Tyr Arg 
130 135 140 

gac gat cat aac get get tct gga tct ggt ggt ttt gtt gcg cct ttt 480 
Asp Asp His Asn Ala Ala Ser Gly Ser Gly Gly Phe Val Ala Pro Phe 
145 150 155 160 

gat cct tat get aac ate aaa cgt gat gee ctg tgatcatgta agaacaacaa 533 
Asp Pro Tyr Ala Asn He Lys Arg Asp Ala Leu 
165 170 

ctagtgcatg ctgettttte acttggttag ttatattcaa gcacaagcac atgcaggtgc 5 93 

agetgeaact atttagcttc atctacaaat cttttttcct ctcttcttct catgetttaa 6 53 

ttatttagag acaatacttg ttattcattg ttatgctcaa ttgetagett ctattcatcg 713 

tcgactgtct gtattgttga tgttcattac agtaacagat aagatggtaa ctgetttact 773 

acttcaaaaa aaaaaaaaaa aaa 796 

<210> 20 

<211> 171 

<212> PRT 

<213> Glycine max 

<400> 20 

Ala Arg Ala Met Ala Gly Val Arg Glu Gin Asp Gin Tyr Met Pro He 

15 10 15 

Ala Asn Val He Arg He Met Arg Arg He Leu Pro Ala His Ala Lys 

15 



20 25 30 

lie Ser Asp Asp Ala Lys Glu Thr lie Gin Glu Cys Val Ser Glu Tyr 

35 40 45 

lie Ser Phe lie Thr Ala Glu Ala Asn Glu Arg Cys Gin Arg Glu Gin 

50 55 60 

Arg Lys Thr Val Thr Ala Glu Asp Val Leu Trp Ala Met Glu Lys Leu 
65 70 75 80 

Gly Phe Asp Asn Tyr Ala His Pro Leu Ser Leu Tyr Leu His Arg Tyr 

85 90 95 

Arg Glu Ser Glu Gly Glu Pro Ala Ser Val Arg Arg Ala Ser Ser Ala 

100 105 110 

Met Gly lie Asn Asn Asn Met Val His Pro Pro Tyr lie Asn Ser His 

115 120 125 

Gly Phe Gly Met Phe Asp Phe Asp Pro Ser Ser Gin Gly Phe Tyr Arg 

130 135 140 

Asp Asp His Asn Ala Ala Ser Gly Ser Gly Gly Phe Val Ala Pro Phe 
145 150 155 160 

Asp Pro Tyr Ala Asn lie Lys Arg Asp Ala Leu 
165 170 

<210> 21 
<211> 1098 
<212> DNA 

<213> Triticum aestivum 

<220> 

<221> CDS 

<222> (55) . . . (894) 

<400> 21 

gcacgagcaa gtgcgagtgc gactacctgc attgcacctt ggctagccct agac atg 57 

Met 
1 

gag aac gac ggc gtc ccc aac gga cca gcg gcg ccg gca cct acc cag 105 
Glu Asn Asp Gly Val Pro Asn Gly Pro Ala Ala Pro Ala Pro Thr Gin 
5 10 15 

ggg acg ccg gtg gtg egg gag cag gac egg ctg atg ccg ate gcg aac 153 
Gly Thr Pro Val Val Arg Glu Gin Asp Arg Leu Met Pro lie Ala Asn 
20 25 30 

gtg ate cgc ate atg cgc cgt gcg etc cct gee cac gee aag ate tec 2 01 

Val lie Arg lie Met Arg Arg Ala Leu Pro Ala His Ala Lys lie Ser 
35 40 45 

gac gac gec aag gag gcg att cag gaa tgc gtg tec gag ttc ate age 24 9 

Asp Asp Ala Lys Glu Ala lie Gin Glu Cys Val Ser Glu Phe lie Ser 
50 55 60 65 

ttc gtc acc ggc gag gee aac gaa egg tgc cgc atg cag cac cgc aag 2 97 

Phe Val Thr Gly Glu Ala Asn Glu Arg Cys Arg Met Gin His Arg Lys 
70 75 80 

acc gtc aac gee gaa gac ate gtg tgg gee eta aac cgc etc ggc ttc 345 
Thr Val Asn Ala Glu Asp lie Val Trp Ala Leu Asn Arg Leu Gly Phe 
85 90 95 

gac gac tac gtc gtg ccc etc age gtc ttc ctg cac cgc atg cgc gac 393 
Asp Asp Tyr Val Val Pro Leu Ser Val Phe Leu His Arg Met Arg Asp 
100 105 110 
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ccc gag gcg ggg aca ggt ggt gcc get gca ggc gac age cgc gec gtg 441 
Pro Glu Ala Gly Thr Gly Gly Ala Ala Ala Gly Asp Ser Arg Ala Val 
115 120 125 

acg agt gcg cct ccc cgc gcg gcc ccg ccc gtg ate cac gcc gtg ccg 4 89 

Thr Ser Ala Pro Pro Arg Ala Ala Pro Pro Val lie His Ala Val Pro 
130 135 140 145 

ctg cag get cag cgc ccg atg tac gcg ccc ccg get ccg ttg cag gtt 53 7 

Leu Gin Ala Gin Arg Pro Met Tyr Ala Pro Pro Ala Pro Leu Gin Val 
150 155 160 

gag aat cag atg cag egg cct gtg tac get ccc ccg get ccg gtg cag 585 
Glu Asn Gin Met Gin Arg Pro Val Tyr Ala Pro Pro Ala Pro Val Gin 
165 170 175 

gtt cag atg cag egg ggc ate tat ggg ccc egg get cca gtg cac ggg 6 33 

Val Gin Met Gin Arg Gly lie Tyr Gly Pro Arg Ala Pro Val His Gly 
180 185 190 

tac gcc gtc gga atg gcg ccc gtg egg gcc aac gtc ggc ggg cag tac 681 
Tyr Ala Val Gly Met Ala Pro Val Arg Ala Asn Val Gly Gly Gin Tyr 
195 200 205 

cag gtg ttc ggc gga gag ggt gtc atg gcc cag caa tac tac ggg tac 729 
Gin Val Phe Gly Gly Glu Gly Val Met Ala Gin Gin Tyr Tyr Gly Tyr 
210 215 220 225 

ggg tac gag gaa gga gcg tac ggc gca ggt age age aac gga gga gcc 777 
Gly Tyr Glu Glu Gly Ala Tyr Gly Ala Gly Ser Ser Asn Gly Gly Ala 
230 235 240 

gcc att ggc gac gag gag age teg tec aac ggc gtg ccg gca ccg ggg 825 
Ala lie Gly Asp Glu Glu Ser Ser Ser Asn Gly Val Pro Ala Pro Gly 
245 250 255 

9 a 9" ggc atg ggg gag cca gag cca gag cca gca gca gaa gaa teg cat 8 73 

Glu Gly Met Gly Glu Pro Glu Pro Glu Pro Ala Ala Glu Glu Ser His 
260 265 270 

gac aag ccc gtc caa tct ggc tagtcgegtg cgcggcgcgc gttagcttct 924 
Asp Lys Pro Val Gin Ser Gly 
275 280 

gcgtcctgtg tactgtaata atttgeegtg tcgatccggc catggtttgt gtgtgcgtag 984 
tgcttatcta atgtgggctt gtcctctagt aattcatgta ttgettatet aatgtggact 1044 
tgtcctctag taattcatgt actctttget gttgaaaaaa aaaaaaaaaa aaaa 1098 

<210> 22 
<211> 280 
<212> PRT 

<213> Triticum aestivum 
<400> 22 

Met Glu Asn Asp Gly Val Pro Asn Gly Pro Ala Ala Pro Ala Pro Thr 

15 10 15 

Gin Gly Thr Pro Val Val Arg Glu Gin Asp Arg Leu Met Pro lie Ala 

20 25 30 

Asn Val lie Arg lie Met Arg Arg Ala Leu Pro Ala His Ala Lys lie 

35 40 45 

Ser Asp Asp Ala Lys Glu Ala lie Gin Glu Cys Val Ser Glu Phe lie 
50 55 60 
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Ser 


Phe 


Val 


Thr Gly 


Glu Ala 


Asn 


Glu 


Arg 


Cys 


Arg 


Met 


Gin 


His 


Arg 


65 










70 








75 










80 


Lys 


Thr 


Val 


Asn 


Ala 


Glu Asp 


He 


Val 


Trp 


Ala 


Leu 


Asn 


Arg 


Leu 


Gly 










85 








90 










95 




Phe 


Asp 


Asp 


Tvr 


Val 


Val Pro 


Leu 


Ser 


Val 


Phe 


Leu 


His 


Arg 


Met 


Arg 








100 








105 










110 






Asp 


Pro 


Glu 


Ala 


Glv 


Thr Gly 


Gly 


Ala 


Ala 


Ala 


Gly 


Asp 


Ser 


Arg 


Ala 






115 








120 










125 








Val 


Thr 


Ser 


Ala 


Pro 


Pro Arg 


Ala 


Ala 


Pro 


Pro 


Val 


He 


His 


Ala 


Val 




130 








135 










140 










Pro 


Leu 


Gin 


Ala 


Gin 


Arg Pro 


Met 


Tyr 


Ala 


Pro 


Pro 


Ala 


Pro 


Leu 


Gin 


145 










150 








155 










160 


Val 


Glu 


Asn 


Gin 


Met 


Gin Arg 


Pro 


Val 


Tyr 


Ala 


Pro 


Pro 


Ala 


Pro 


Val 










165 








170 










175 




Gin 


Val 


Gin 


Met 


Gin 


Arg Gly 


He 


Tyr 


Gly 


Pro 


Arg 


Ala 


Pro 


Val 


His 








180 








185 










190 






Gly Tyr 


Ala 


Val 


Gly Met Ala 


Pro 


Val 


Arcr 


Ala 


Asn 


Val 


Gly 


Gly 


Gin 






195 








200 










205 








Tyr 


Gin 


Val 


Phe 


Gly Gly Glu 


Gly 


Val 


Met 


Ala 


Gin 


Gin 


Tyr 


Tyr 


Gly 




210 








215 










220 










Tyr 


Gly 


Tyr 


Glu 


Glu 


Gly Ala 


Tyr 


Gly 


Ala 


Gly 


Ser 


Ser 


Asn 


Gly 


Gly 


225 










230 








235 










240 


Ala 


Ala 


He 


Gly Asp 


Glu Glu 


Ser 


Ser 


Ser 


Asn 


Gly 


Val 


Pro 


Ala 


Pro 










245 








250 










255 




Gly Glu 


Gly 


Met 


Gly Glu Pro 


Glu 


Pro 


Glu 


Pro 


Ala 


Ala 


Glu 


Glu 


Ser 








260 








265 










270 






His 


Asp 


Lys 


Pro 


Val 


Gin Ser 


Gly 






















275 








280 




















<210> 


23 


























<211> 


65 


























<212> 


PRT 


























<213> 


Artificial Sequence 




















<220> 




























<223> 


LEC1 consensus protein sequence 














<221> 


VARIANT 
























<222> 


(1) 


. . . (65) 






















<223> 


Xaa 


- Any Amino Acid 


















<400> 


23 
























Arg 


Glu 


Gin 


Asp 


Xaa 


Xaa Met 


Pro 


He 


Ala 


Asn 


Val 


He 


Arg 


He 


Met 


1 








5 








10 










15 




Arg 


Xaa 


Xaa 


Leu 


Pro 


Xaa His 


Ala 


Lys 


He 


Ser 


Asp 


Asp 


Ala 


Lys 


Glu 








20 








25 










30 






Xaa 


He 


Gin 


Glu 


Cys 


Val Ser 


Glu 


Tyr 


He 


Ser 


Phe 


Xaa 


Thr 


Xaa 


Glu 






35 








40 










45 








Ala 


Asn 


Xaa 


Arg 


Cys 


Xaa Xaa 


Xaa 


Xaa 


Arg 


Lys 


Thr 


Xaa 


Xaa 


Xaa 


Glu 



50 55 60 

Xaa 
65 
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